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NOVEL PHOSPHODIESTERASE INTERACTING PROTEINS 

5 ACKNOWLEDGMENT OF GOVERNVmNT S^TTppnpr 

This invention was made with Government support under Grant No. HD20788 
awarded by the National Institutes of Health. The Government has certain rights m this 
invention. 

10 INTRODUCTION 

Field of the Tnventinn 

The field of the invention is cyclic nucleotide phosphodiesterases, particularly cAMP 
phosphodiesterases. 
Background of the Tnventinn 

15 Cyclic nucleotide phosphodiesterases are a class of enzymes that catalyze the 

hydrolysis of phosphodiester bonds in cyclic nucleotides, e.g. cAMP. Cyclic nucleotides are 
important second messengers that regulate and mediate a number of cellular responses to 
extracellular signals, such as hormones, light and neurotransmitters. Since cyclic nucleotide 
phosphodiesterases modulate the concentration of cycUc nucleotides, these enzymes play a 

20 significant role in signal transduction. There are at least ten diflferent classes of cyclic 

phosphodiesterases, seven of which are: (I) Ca(2+)/cahnoduhn-dependent PDEs; (H) cGMP- 
stimulated PDEs; (EI) cGMP-inhibited PDEs; (IV) cAMP-specific PDEs; (V) cGMP-spedfic 
PDEs; (VI) photoreceptor PDEs; and (VII) high-affinity, cAMP-specific PDEs. Because of 
their role in signal transduction, cychc nucleotide phosphodiesterases have been pxirsued as 

25 therapeutic or pharmacologic targets in the modulation of a variety of distinct physiological 
processes. 

cAMP phosphodiesterase inhibitors hold great promise as therapeutic agents for use in 
the treatment of inflammation. Specifically, data indicates that these types of inhibitors are as 
effective, or even more effective, than adrenal steroids in suppressing most fiinctions of 
30 infl a mm a t ory cells, including: migration, adhesion and secretion of cytokines. Specific cAMP 
phosphodiesterase inhibitors that have been studied inchide: rolipram, theophylline, and the 
like. In addition, research is ongoing to identify new cAMP phosphodiesterase inhibitors. 
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Despite their promise as anti-inflammatory therapeutic agents, cAMP- 
phosphodiesterase inhibitors identified to date have demonstrated significant toxic side effects 
that have limited to their generalized use in the treatment of mflammation. 
" ' "As such, there is continued interest in the identification of new, more selective cAMP 
phosphodiesterase inhibitors for potential use as anti-inflammatory therapeutic agents. These 
efforts have employed recombinant phosphodiesterases for automated screening of candidate 
agents. Use of recombinant phosphodiesterases in screening applications has, however, been 
problematic as such recombinant enzymes have altered conformation as compared to their 
naturally occurring counterparts, which affects the interaction with potential inhibitors and 
thereby confounds the results that are obtained. As such, the screening results obtained by 
using such recombinant proteins are problematic. 

Therefore, there is much interest in the further elucidation of the conformation of 
phosphodiesterases and other factors that may modulate the interaction of these enzymes with 
inhibitors. 
Relevant Literature 

The role of c AMP phosphodiesterases in inflammatory processes is reviewed m 
Torphy, Am. J. Respir. Crit. Care Med. (1998) 157:351-370. See also Houslay et al,. Adv. 
Pharmacol (1 998) 44: 225-342 and Spina et al.. Adv. Pharmacol (1998) 44: 33-89, as well as 
U.S. Patent No. 5,798,373, the disclosure of which is herein incorporated by reference. 

SUMMARY OF THR TNVFNTTniy 
Nucleic acid compositions encoding phosphodiesterase interacting proteins, e.g. 
myomegalin, as well as the polypeptide compositions encoded thereby, are provided. Also - 
provided are complexes of the subject phosphodiesterase interacting protein with a 
phosphodiesterase enzyme. The subject polypeptide and nucleic acid compositions, as well as 
complexes thereof, find use m a variety of applications, including research, diagnostic, and 
therapeutic agent identification and screening applications, as well as in therapeutic 
applications. 



BRIEF DRSCRIPTTON OF T HE HGURES 
Figure 1 provides the amino acid sequence of rat myomegalm. 
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Figure 2 provides the cDNA sequence of a clone having an open reading frame 
encoding the myomegalin protein having the amino acid sequence of Figure 1. 

Figure 3 provides the nucleic acid sequence from the first met to the first stop codon 
in the sequence of Figure 2. 

Figure 4 provides the nucleic acid sequence of human myomegalin. 

Figure 5 provides the amino acid sequence of human myomegalin. 

Figure 6 provides the amino acid sequence of rat M14 protein. 

DETAILED DESCRrPTTQN OF THF. TNyPNTy p^j 
Novel phosphodiesterase interacting proteins, particularly myomegalin, as well as 
nucleic acid compositions encoding the same, are provided. Also provided are complexes of 
the subject protems and phosphodiesterases. The subject polypeptide and nucleic acid 
compositions find use in a variety of applications, including research, diagnostic, and 
therapeutic agent identification and screening applications, as well as in therapeutic 
applications. 

Before the subject invention is described fimher, it is to be understood that the 
invention is not limited to the particular embodiments of the invention described below, as 
variations of the particular embodiments may be made and still fall within the scope of the 
appended claims. It is also to be understood that the terminology employed is for the purpose 
of describing particular embodiments, and is not intended to be limiting. Instead, the scope of 
the present invention will be established by the appended claims. 

In this specification and the appended claims, the singular forms "a," "an," and "the" 
include plural reference unless the context clearly dictates otherwise. Unless defined 
otherwise, all technical and scientific terms used herein have the same meaning as commonly 
understood to one of ordmary skill in the art to which this mvention belongs. 

Nucleic Acid CoMPosmoNs 

Nucleic acid compositions encoding phosphodiesterase (PDE) interacting proteins, as 
well as fi-agments thereof, are provided. The subject nucleic acid compositions encode 
proteins that interact with a phoshodiesterase enzyme, modulate its conformation and direct 
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its location in a ceU, In other words, the proteins encoded by the subject nucleic acid 
compositions are those that target a (PDE) to a particular subceUuiar compartment and alter 
the function and/or properties of the PDE. Of particular interest are nucleic acid compositions 
which encode proteins that bind to a PDE IV isoenzyme, including PDE4A, PDE4B, PDE4C 
PDE4D, and the like. 

By nucleic acid composition is meant a composition comprising a sequence of DNA 
having an open reading frame tiiat encodes a PDE interacting polypeptide, i.e. a gene 
encoding a polypeptide that interacts witii a PDE (e.g. binds to and targets a PDE), and is 
capable, under appropriate conditions, of being expressed as a PDE interacting polypeptide. 
Also encompassed in tiiis term are nucleic acids tiiat are homologous, substantially similar or 
identical to the nucleic acids encoding PDE interacting polypeptides or proteins. Thus, the 
subject invention provides genes encoding mammalian PDE interacting proteins, such as 
genes encoding human PDE interacting polypeptides and homologs thereof, as weU as non- 
human mammaUan PDE interacting polypeptides and homologs thereof, e.g. rat and mouse 
proteins. 

Of particular interest is a nucleic acid composition encoding a myomegalin protein, 
particularly a mammalian myomegalin protein, described in greater detail infra, or a fragment 
or homolog tiiereof Specific nucleic acid compositions of interest include: polynucleotides 
encoding a rat myomegalm protein, such as polynucleotides having a nucleotide sequence 
found in SEQ ID NOs: 1 or 3, including polynucleotides in which the entire sequence is tiie 
same as the sequence of SEQ ID NOs. 1 or 3; and polynucleotides encoding human 
myomegalin protein, such as polynucleotides having a nucleotide sequence found in SEQ ID 
NO:04, including polynucleotides in which tiie entire sequence is the same as the sequence of 
SEQ ID NOs. 04, as well as those in which the entire sequence is the same as tiie sequence of 
an ORF found in SEQ ID NO:04. 

Also of interest are nucleic acid compositions encoding an Ml 4 polypeptide, 
described in greater detail infra, or a fragment or homolog tiiereof. Specific nucleic acid 
compositions of interest include polynucleotides encoding a rat M14 polypeptide, such as 
polynucleotides encoding an M14 polypeptide having the amino acid sequence set forth in 
SEQ ID NO:08. Polynucleotides encoding M14 homologs, and polynucleotides encoding 
PDE-interacting Augments of an Ml 4 polypeptide, are also of interest. 
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Also of interest are nucleic acid compositions encoding a huntingtin-interacting 
protein, e.g., HIPl. Specific nucleic acid compositions of interest include a polynucleotide 
encoding a human HBPl polypeptide, including, for example, a polynucleotide as disclosed in 
GenBank Accession No. U79734. 
5 The source of homologous genes to those specifically listed above may be any 

mammali an species, e.g., primate species, particularly human; rodents, such as guinea pigs 
and mice, canines, felines, bovines, ovines, equines, yeast, nematodes, etc. Between 
mamm al i a n species, e.g., human and mouse, homologs have substantial sequence similarity, 
e.g, at least 75% sequencft identity, usually at least 90%, more usually at least 95% between 

10 nucleotide sequences. Sequence similarity is calculated based on a reference sequence, which 
may be a subset of a larger sequence, such as a conserved motif, coding region, flanking 
region, etc. A reference sequence will usually be at least about 18 nt long, more usually at 
least about 30 nt long, and may extend to the complete sequence that is being compared. 
Algorithms for sequence analysis are known in the art, such as BLAST, described in Altschul 

15 eiaL (1990), J. MoL BioL 215:403-10. Unless stated otherwise herein, all sequence identity 
figures provided m this application are determined using the BLAST program at default 
settings (e.g. w=A\ 7=17). The sequences provided herein are essential for recognizing genes 
encoding PDE interacting protein-related and homologous polynucleotides in database 
searches. 

20 Nucleic acids encoding the subject PDE interacting proteins and polypeptides of the 

subject invention may be cDNAs or genomic DNAs, as well as fi-agments thereof Also 
provided are genes comprising the subject nucleic acid compositions, where the term "gene" 
shall be intended to mean the open reading fi-ame encoding specific PDE interacting proteins 
and polypeptides, and introns, as well as adjacent 5' and 3' non-coding nucleotide sequences 

25 involved in the regulation of expression, up to about 20 kb beyond the coding region, but 

possibly further in either direction. The gene may be introduced into an appropriate vector for 
extrachromosomal maintenance or for integration into a host genome. 

The term "cDNA" as used herein is intended to include all nucleic acids that share the 
arrangement of sequence elements found in native mature mRNA species, where sequence 

30 elements are exons and 3 ' and 5 ' non-coding regions. Normally mRNA species have 

contiguous exons, with the intervening introns, when present, being removed by nuclear RNA 
splicing, to create a continuous open reading fi^e encoding an PDE interacting protein. 
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A genomic sequence of interest comprises the nucleic acid present between the 
initiation codon and the stop codon, as defined in the listed sequences, inchiding all of the 
introns that are normally present in a native chromosome. It may further include the 3 ' and 5' 
untranslated regions found in the mature mRNA. It may further include specific 
transcriptional and translational regulatory sequences, such as promoters, enhancers, etc, 
including about 1 kb, but possibly more, of flanking genomic DNA at either the 5' or 3' end 
of the transcribed region. The genomic DNA may be isolated as a fi-agment of 100 kbp or 
smaller; and substantially fi-ee of flanking chromosomal sequence. The genomic DNA flanking 
the coding region, either 3' or 5', or internal regulatory sequences as sometimes found in 
introns, contains sequences required for proper tissue and stage specific expression. 

The nucleic acid compositions of the subject invention may encode all or a part of the 
subject PDE interacting proteins and polypeptides, described in greater detail infra. Double 
or single stranded fi-agments may be obtained fi-om the DNA sequence by chemically 
synthesizing oligonucleotides in accordance with conventional methods, by restriction enzyme 
digestion, by PGR amplification, etc. For the most part, DNA fragments will be of at least 
1 5 nt, usually at least 1 8 nt or 25 nt, and may be at least about 50 nt. 

The genes of the subject invention are isolated and obtained in substantial purity, 
generally as other than an intact chromosome. Usually, the DNA will be obtained 
substantially free of other nucleic acid sequences that do not include a sequence encoding a 
PDE interactmg protein or fragment thereof, generally being at least about 50%, usually at 
least about 90% pure and are typically "recombinant," i.e, flanked by one or more nucleotides 
with which it is not normally associated on a naturally occurring chromosome. 

In addition to the plurality of uses described in greater detail in following sections, the 
subject nucleic acid compositions find use in the preparation of all or a portion of the PDE 
interacting polypeptides, as described below. 

Polypeptide Compositions 

Also provided by the subject invention are PDE interacting proteins and polypeptides, i.e. 
proteins and polypeptides that are capable of binding to and modulating PDEs, specifically 
cAMP-PDEs, and more particularly cAMP-PDE4 isofonns, such as PDE4A, PDE4B, PDE4C, 
PDE4D, and the like. 
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The term polyeptide composition as used herein refers to both the full length proteins 
as well as portions or fragments thereof. Also included in this term are variations of the 
naturally occurring proteins, where such variations are homologous or substantially similar to 
the naturally occurring protein, as described in greater detail below, be the naturally occurring 
5 protein the human protein, rat protein, or protein from some other species which naturally 
expresses an PDE mteracting protein, usually a mammalian species. In the following 
description of the subject invention, the term PDE mteracting protein is used to refer not only 
to the himiau form of such proteins, but also to homologs thereof expressed in non-human 
species, e.g. murine, rat and other mammalian species. 
10 The subject PDE proteins are, in their natural environment, capable of modulating the 

form/function of PDEs, as well as targeting PDEs to specific subcellular compartments within 
a cell. In many embodhnents, the subject PDE interacting proteins serve as PDE anchoring 
proteins. 

In many embodiments, the subject proteins are characterized by the presence of one or 
1 5 more coiled domains and leucine zippers. Furthermore, in certain embodiments, e.g. certain 
rat myomegalin proteins, the subject proteins have a region of high homology with 
Drosophila centrosomin, whereby high homology is meant at least about 30, usually at least 
about 40 % sequence identity. 

In many embodiments, the proteins range in length from about 1500 to 3000, usually 
20 from about 1600 to 2800 and more usually from about 1650 to 2600 amino acid residues, and 
the projected molecular weight of the subject proteins based solely on the number of amino 
acid residues in the protein ranges from about 150 to 320, usually from about 160 to 300 
kDa, where the actual molecular weight may vary depending on the amount of glycolsylation, 
if any, of the protein and the apparent molecular weight may be considerably less (40 to 50 
25 kDa) due to SDS binding on gels. On other embodiments, the length of the proteins may be 
much smaller, e.g. as in the case of splice variants or post translated products, where the 
length in these proteins may be as short as 40%, usually no shorter than about 50% of the 
above lengths. 

Of particular interest in many embodiments are proteins that are non-naturally 
30 glycosylated. By non-naturally glycosylated is meant tiiat the protein has a glycosylation 
pattern, if present, which is not the same as the glycosylation pattern found in the 
corresponding naturally occurring protein. For example, a human phosphodiesterase binding 
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protein of the subject invention and of this particular embodiment is characterized by having a 
glycosylation pattern, if it is glycosylated at aU, that differs from that of naturally occurring 
human PDE bindmg protein. Thus, the non-naturally glycosylated PDE interacting or binding 
proteins of this embodiment include non-glycosylated PDE interacting proteins, i.e. proteins 
having no covalently bound glycosyl groups. 

A PDE interacting protein of the subject invention of particular interest is 
myomegaUn, particularly mammaUan myomegaUn and more particuarly, rat or human 
myomegalin. In many embodiments, mammaUan myomegaUn ranges in length from about 
2000 to 3000, usually from about 2700 to 2800 and more usually from about 2300 to 2600 aa 
residues. The projected molecular weight of these myomegaUn proteins based solely on the 
number of amino acid residues in the protein ranges from about 220 to 320, usually from 
about 220 to 300 and more usually from about 240 to 300 kDa, where the actual molecular 
weight may vary depending on the amount of glycolsylation, if any, of the protein and the 
apparent molecular weight may be considerably less (40 to 50 kDa) due to SDS binding on 
gels. Also of interest are mammaUan myomegaUn proteins that are shorter than those 
described above, where these shorter proteins could be spUce variants or the products of 
post-translational activity, and the Uke. 

Of particular interest in certain embodiments is the rat myomegaUn protein, where the 
rat myomegaUn protein of the subject invention has an amino acid sequence that is 
substantially the same as or identical to the sequence appearing as SEQ ID NO:02 infra and 
appearing in Figure 1 By substantially the same as is meant a protein having a sequence that 
has at least about 80%, usuaUy at least about 90% and more usually at least about 98% 
sequence identity with the sequence of SED ID NO:02. Also of particular interest is an 
approximately 65 kDa rat myomegaUn protein expressed in rat testis. Yet another protein of 
particular interest is the human myomegaUn protein of the subject invention which has an 
amino add sequence that is substantially the same as or identical to the sequence appearing as 
SEQ ID NO:05 irtfra and appearing in Figure 5. By substantially the same as is meant a 
protein having a sequence that has at least about 80%, usuaUy atjeast about 90% and more 
usually at least about 98% sequence identity with the sequence of SED ID NO:05. 

Another PDE interacting protein of the subject invention of particular interest is M14, 
particularly mammaUan MM, and more particulariy, rat or human Ml 4. In many 
embodiments, mammaUan M 14 ranges in length from about 1500 to about 2000, usuaUy from 
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about 1600 to about 1800, usually from about 1650 to about 1700, and more usually from 
about 1670 to about 1690 amino acid residues. The projected molecular weight of these M14 
polypeptides, based solely on the number of amino acid residues in the protein, ranges from 
about 150 to about 200 kDa, usually from about 160 to about 180 kDa, usually from about 
5 165 to about 170 kDa. Rat M14 protein has a mobility on SDS-PAGE of about 185 kDa. 
The actual molecular weight may vary depending on the amount of glycosylation or other 
post-translational modifications, if any, of the protein, and the apparent molecular weight may 
be considerably less (e.g. 40-50 kDa) due to SDS binding on gels. Also of interest are PDE- 
interacting fragments of the above-described Ml 4 proteins. 

10 Of particular interest in certain embodunents is a rat M14 protein, where the rat M14 

protein of the subject invention has an amino acid sequence that is substantially the same or 
identical to the sequence set forth in SEQ ID NO:08 and appearing in Figure 6. By 
substantially the same as is meant a protein having a sequence that has at least about 80%, 
usually at least about 90% and more usually at least about 98% sequence identity with the 

15 sequence of SED ID NO:08. Proteins homologous to rat M14 are also of interest, including, 
e.g., an Ese2L protein as described in Sengar et aL (\999)EMBOJ. 18:1159-1171. 

Also of interest are huntingtin interacting proteins, and PDE-interacting fragments, 
variants and homologs thereof In some embodiments, huntingtin interacting protein (HEP) is 
a human mPl protein having an amino acid sequence as disclosed in GenBank Accession No. 

20 U79734, The human HIPl protein is described in Kalchman et al. (1997) Nature Genetics 
16:44-53. 

In addition to the specific PDE interactmg proteins described above, homologs or 
proteins (or fragments thereof) from other species, i.e. other animal or plant species, are also 
provided, where such homologs or proteins may be from a variety of diflFerent types of 

25 species, usually mammals, e.g. rodents, such as mice, rats; domestic animals, e.g. horse, cow, 
dog, cat; and humans. By homolog is meant a protein having at least about 35 %, usuaUy at 
least about 40% and more usually at least about 60 % amino acid sequence identity with a 
specific PDE interacting protein as identified in: (a) SEQ ID NO: 02 and appearing in Figure 
1 ; or (b) SEQ ID NO:05 and appearing in Figure 5; or (c) SEQ ID NO:08 and appearing in 

30 Figure 6. 

The PDE interacting proteins of the subject invention (e.g. human myomegalin, rat 
myomegalin or homologs thereof) are present in a non-naturally occurring environment, e.g. 
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are separated from their naturally occurring environment. In certain embodiments, the subject 
protein is present in a composition that is enriched for the subject protein as compared to the 
protein in its naturally occurring environment. As such, purified PDE interacting protein is 
provided, where by purified is meant that PDE interactmg protein is present in a composition 
that is substantially fi-ee of non PDE interacting proteins, where by substantially free is meant 
that less than 90 %, usually less than 60 % and more usually less than 50 % of the 
composition is made up of non-PDE interacting proteins. 

In certain embodiments of interest, the PDE interacting protein is present in a 
composition that is substantially free of the constituents that are present in its naturally 
occurring environment. For example, a human PDE interacting protein comprising 
composition according to the subject invention in this embodiment will be substantially, if not 
completely, free of those other biological constituents, such as proteins, carbohydrates, lipids, 
etc., with which it is present in its natural environment. As such, protein compositions of 
these embodiments will necessarily differ from those that are prepared by purifying the protein 
from a naturally occurring source, where at least trace amounts of the protein's constituents 
will still be present in the composition prepared from the naturally occurring source. 

The PDE interacting protein of the subject invention may also be present as an isolate, 
by which is meant that the PDE interacting protein is substantially free of both non-PDE 
interacting proteins and other naturally occurring biologic molecules, such as 
oligosaccharides, polynucleotides and fragments thereof, and the like, where substantially free 
in this ins;tance means that less than 70 %, usually less than 60% and more usually less daan 
50 % of the composition containing the isolated PDE interacting protein is a non-PDE 
interacting protein naturally occurring biological molecule. In certain embodiments, the 
subject protein is present in substantially pure form, where by substantially pure form is meant 
at least 95%, usually at least 97% and more usually at least 99% pure. 

In addition to the naturally occurring proteins, polypeptides which vary from the 
naturally occurring proteins are also provided. By polypeptides is meant proteins having an 
amino acid sequence encoded by an open reading frame (ORF) of an gene according to the 
subject invention, described supra, including the M length protein and fragments thereof, 
particularly biologically active fragments and/or fragments corresponding to functional 
domains; and inchiding fusions of the subject polypeptides to other proteins or parts thereof. 
Fragments of interest will typically be at least about 10 aa in length, usually at least about 50 
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aa in length, and may be as long as 300 aa in length or longer, but will usually not exceed 
about 1000 aa in length, where the fragment will have a stretch of amino acids that is identical 
to the protein of SEQ ID NO:02, SEQ ID NO:05, or SEQ ID NO:08, or a homolog thereof; 
of at least about 10 aa, and usually at least about 15 aa, and in many embodiments at least 
5 about 50 aa in length. 

PREPARAnON OF PDE INTERACTING POLYPEPTIDES 

The subject PDE interacting proteins and polypeptides may be obtained from naturally 
occurring sources or synthetically produced. Whei e obtained from naturally occurring 
10 sources, the source chosen will generally depend on the species from which the PDE 

interactmg protein is to be derived, e.g. muscle tissue, heart tissue, brain tissue, testis tissue, 
and the like. 

The subject PDE interacting polypeptide compositions may be synthetically derived by 
expressing a recombinant gene encoding the PDE interacting protein, such as the 

15 polynucleotide compositions described above, in a suitable host. For expression, an 

expression cassette may be employed. The expression vector will provide a transcriptional 
and translational initiation region, which may be inducible or constitutive, where the coding 
region is operably linked under the transcriptional control of the transcriptional initiation 
region, and a transcriptional and translational termination region. These control regions may 

20 be native to the gene encoding the particular PDE interacting protein, or may be derived from 
exogenous sources. 

Expression vectors generally have convenient restriction sites located near the 
promoter sequence to provide for the insertion of nucleic acid sequences encoding 
heterologous proteins. A selectable marker operative in the e^qjression host may be present. 

25 Expression vectors may be used for the production of fusion proteins, where the exogenous 
fusion peptide provides additional fimctionality, i.e. increased protein synthesis, stability, 
reactivity with defined antisera, an enzyme marker, e.g. P-galactosidase, etc. 

Expression cassettes may be prepared conq)rising a transcription initiation region, the 
gene or fragment thereof, and a transcriptional termination region. Of particular interest is the 

30 use of sequences that allow for the expression of functional epitopes or domains, usually at 
least about 8 amino acids in length, more usually at least about 15 amino adds in length, to 
about 25 amino acids, and up to the complete open reading frame of the gene. After 
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introduction of the DNA, the cells containing the construct may be selected by means of a 
selectable marker, the cells expanded and then used for expression. 

The subject proteins and polypeptides may be expressed in prokaryotes or eukaryotes 
in accordance with conventional ways, depending upon the purpose for expression. For large 
5 scale production of the protein, a unicellular organism, such as E, coli, B. subtilis, 
S. cerevisiae, insect cells in combination with baculovirus vectors, or cells of a higher 
organism such as vertebrates, particularly manunals, e.g. COS 7 cells, may be used as the 
expression host cells. In some situations, it is desirable to express the subject proteins m 
; eukaryotic cells, where the protein will benefit from native folding and post-transkdonal 

10 modifications. Small peptides can also be synthesized in the laboratory. Polypeptides that are 
subsets of the complete protein sequence may be used to identify and investigate parts of the 
protein important for fiinction. 

Once the source of the protein is identified and/or prepared, e.g. a transfected host 
expressing the protein is prepared, the protein is then purified to produce the desired PDE 

1 5 interacting protein comprising composition. Any convenient protein purification procedures 
may be employed, where suitable protein purification methodologies are described in Guide to 
Protein Purification, (Deuthser ed.) (Academic Press, 1990). For example, a lysate may be 
prepared from the original source, e.g. naturally occurring cells or tissues that express a PDE 
interactmg protein or the expression host expressing the PDE interacting protein, and purified 

20 using HPLC, exclusion chromatography, gel electrophoresis, aflBnity chromatography, and the 
like. 

Uses of the Subject Polypeptide and Nucleic Acid CoMPosmoNS 

The subject polypeptide and nucleic acid compositions find use in a variety of different 
25 applications, including diagnostic, and therapeutic agent screening/discovery/preparation 
applications, as well as the treatment of disease conditions associated with PDE interacting 
protein activity. 

General Applications 

30 The subject nucleic acid compositions find use in a variety of applications, including: 

(a) the identification of PDE interacting protein gene homologs, e.g. myomegalin homologs; 

(b) as a source of novel promoter elements; (c) the identification of PDE interacting protein 
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expression regulatory factors; (d) as probes and primers in hybridization applications, e.g. 
PCR; (e) the identification of expression patterns in biological specimens; (f) the preparation 
of cell or animal models for PDE interacting protein function; (g) the preparation of m vitro 
models for PDE interacting protein function; etc. 

5 

Identification of hnryi^lngf^ 

Homologs of the PDE interacting protein gene, e.g. the myomegalin gene, or the 
M14 gene, are identified by any of a number of methods. A fi-agment of the provided cDNA 
may be used as a hybridization probe against a cDNA library fi-om the target organism of 

1 0 interest, where low stringency conditions are used. The probe may be a large Augment, or one 
or more short degenerate primers. Nucleic acids having sequence similarity are detected by 
hybridization under low stringency conditions, for example, at SOX and 6xSSC (0.9 M 
sodium chIoride/0.09 M sodium citrate) and remain bound when subjected to washing at 
55**C in 1 xSSC (0. 15 M sodium chloride/0.015 M sodmm citrate). Sequence identity may be 

15 determined by hybridization under stringent conditions, for example, at SO^^C or higher and 
O.lxSSC (15 mM sodium chloride/01.5 mM sodium citrate). Nucleic acids having a region 
of substantial identity to the provided sequences, e.g. allelic variants, genetically altered 
versions of the gene, etc., bind to the provided sequences under stringent hybridization 
conditions. By using probes, particulariy labeled probes of DNA sequences, one can isolate 

20 homologous or related genes. 

Identification of Nnve! Prnmnter FJempqti 

The sequence of the 5' flanking region may be utilized for promoter elements, 
including enhancer bindmg sites, that provide for regulation in tissues where the subject gene 
25 is expressed. The tissue specific expression is useful for determining the pattern of expression, 
and for providing promoters that mimic the native pattern of expression. Naturally occurring 
polymorphisms in the promoter region are useful for determining natural variations in. 
e?q)ression, particularly those that may be associated with disease. 

30 Identification of Expression Regulatory FflctnrQ 

Alternatively, mutations may be introduced into the promoter region to determine the 
effect of altering expression in experimentally defined systems. Methods for the identification 

-13- 



wo 00/27861 



PCT/i;S99/26860 



of specific DNA motifs involved in the binding of transcriptional factors are known in the art, 
e.g, sequence similarity to known binding motifs, gel retardation studies, eta For examples, 
see Blackwell ei al (1995), Mol Med. 1: 194-205; Mortlock et ai (1996), Genome Res, 
6:327-33; and Joulin and Richard-Foy (1995), Eur. J. Biochem. 232:620-626. 

The regulatory sequences may be used to identify cis acting sequences required for 
transcriptional or transiational regulation of expression of the subject gene, e.g. the 
myomegalin gene, especially in different tissues or stages of development, and to identify cis 
acting sequences and //•aw5-acting factors that regulate or mediate expression of the subjea 
gene. Such transcription or transiational control regions mxy be operably Unked to a gene of 
the subject invention in order to promote expression of wild type or altered PDE interacting 
protein, e.g. myomegalin, or other proteins of interest in cultured cells, or m embryonic, fetal 
or adult tissues, and for gene therapy. 

Probes and Primerf: 

Small DNA fragments are useful as primers for PGR, hybridization screening probes, 
etc. Larger DNA fragments, Le, greater than 100 nt are useful for production of the encoded 
polypeptide, as described in the previous sectioa For use in amplification reactions, such as 
PGR, a pair of primers will be used. The exact composition of the primer sequences is not 
critical to the invention, but for most applications the primers will hybridize to the subject 
sequence under stringent conditions, as known in the art. It is preferable to choose a pair of 
primers that will generate an amplification product of at least about 50 nt, preferably at least 
about 100 nt. Algorithms for the selection of primer sequences are generally known and are 
available in commercial software packages. Amplification primers hybridize to 
complementary strands of DNA, and will prime towards each other. 

Mentification of Expression Patterns in Biological Specimens 

The DNA may also be used to identify expression of the gene in a biological 
specimen. The manner in which one probes cells for the presence of particular nucleotide 
sequences, as genomic DNA or RNA, is well established in the literature. Briefly, DNA or 
mRNA is isolated from a cell sample. The mRNA may be amplified by RT-PGR, using reverse 
transcriptase to form a complementary DNA strand, followed by polymerase chain reaction 
amplification using primers specific for tiie subject DNA sequences. Alternatively, the mRNA 
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sample is separated by gel electrophoresis, transferred to a suitable support, e.g, 
nitrocellulose, nylon, etc., and then probed with a fragment of the subject DNA as a probe. 
Other techniques, such as oligonucleotide ligation assays, in situ hybridizations, and 
hybridization to DNA probes arrayed on a solid chip may also find use. Detection of mRNA 
hybridizing to the subject sequence is indicative of gene expression in the sample. 

The Preparation of PDE Tnteractinp Prnt^tn Mllta^lTff 

The sequence of a gene according to the subject invention, including flanking 
promoter regions and coding regions, may be mutated in various ways known in the art tc 
generate targeted changes in promoter strength, sequence of the encoded protein, etc. The 
DNA sequence or protein product of such a mutation will usually be substantially similar to 
the sequences provided herein, /.e. will differ by at least one nucleotide or amino add, 
respectively, and may differ by at least two but not more than about ten nucleotides or amino 
acids. The sequence changes may be substitutions, msertions, deletions, or a combination 
thereof Deletions may fiirther include larger changes, such as deletions of a domain or exon. 
Other modifications of interest include epitope tagging, e.g, with the FLAG system, HA, etc. 
For studies of subcelhilar localization, fiision proteins with green fluorescent proteins (GFP) 
may be used. 

Techniques for in vitro mutagenesis of cloned genes are known. Examples of 
protocols for site specific mutagenesis may be found in Gustin et ai (1993), Biotechniques 
14:22; Barany (1985), Ge/ie 37:111-23; Colicelli e/ a/. (198S)Mol Gen. Genet 199:537-9; 
and Prentki et al, (1984), Gene 29:303-13. Methods for site specific mutagenesis can be 
found in Sambrook et aL, Molecular Cloning: A Laboratory Marmal, CSH Press 1989, pp. 
15.3-15.108; Weinere/oi (1993), Gene 126:35-41; Sayerse/o/. (1992X Biotechniques 
13:592-6; Jones and Winistorfer (1992), Biotechniques 12:528-30; Barton etal, (1990), 
Nucleic Acids Res 18:7349-55; Marotti and Tomich (1989), Gene Anal. Tech. 6:67-70; and 
Zhu (1989), AnalBiochem 177: 120-4. Such mutated genes may be used to study structure- 
fimction relationships of PDE interacting proteins, or to alter properties of the protein that 
affect its fimction or regulation. 
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Prodwction of In Vivo Models of PDE Tnteracting Protein Fnnctinn 

The subject nucleic acids can be used to generate transgenic, non-human animals or 
site specific gene modifications in cell lines. Transgenic animals may be made through 
homologous recombination, where the normal PDE interacting protein gene locus is altered. 
Alternatively, a nucleic acid construct is randomly integrated mto the genome. Vectors for 
stable integration include plasmids, retroviruses and other animal viruses, YACs, and the like. 

The modified cells or animals are useful in the study of PDE interacting protein 
function and regulation. For example, a series of small deletions and/or substitutions may be 
made in the host's native PDE interacting protein gene to determine the role of diflFerent exons 
in cholesterol metabolism, e.g. cholesterol ester synthesis, cholesterol absorption, etc. 
Specific constructs of interest include anti-sense constructs which will block PDE mteracting 
protein expression, expression of dominant negative gene mutations, and over-expression of 
PDE mteracting protein genes. Where a particular genetic sequence is introduced, the 
introduced sequence may be either a complete or partial sequence of an PDE interacting 
protein gene native to the host, or may be a complete or partial sequence that is exogenous to 
the host animal, e,g., a human sequence. A detectable marker, such as lac Z, may be 
introduced into the locus, where upregulation of gene expression will resuh in an easily 
detected change in phenotype. 

One may also provide for expression of the gene or variants thereof in cells or tissues 
where it is not normally expressed, at levels not normally present in such cells or tissues, or at 
abnormal times of development. 

DNA constnicts for homologous recombination will comprise at least a portion of the 
gene native to the species of the host animal, wherein the gene has the desired genetic 
modification(s), and includes regions of homology to the target locus. DNA constructs for 
random integration need not include regions of homology to mediate recombination. 
Conveniently, markers for positive and negative selection are included. Methods for 
generating cells having targeted gene modifications through homologous recombination are 
known in the art. For various techniques for transfecting mammalian cells, see Keown et aL 
(\990\ MetkEmymoL 185:527-537. 

For embryonic stem (ES) cells, an ES cell line may be employed, or embryonic cells 
may be obtained fi-eshly fi-om a host, e.g. mouse, rat, guinea pig, etc. Such cells are grown on 
an ^propriate fibroblast-feeder layer or grown in the presence of leukemia inhibiting fartor 
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(LIF). When ES or embryonic cells have been transformed, they may be used to produce 
transgenic animals. After transformation, the cells are plated onto a feeder layer in an 
appropriate medium. Cells containing the construct may be detected by employing a selective 
medium. After sufl5cient time for colonies to grow, they are picked and analyzed for the 
occurrence of homologous recombination or integration of the construct. Those colonies that 
are positive may then be used for embryo manipulation and blastocyst injection. Blastocysts 
are obtained fi-om 4 to 6 week old superovulated females. The ES cells are trypsinized, and ' 
the modified cells are injected into the blastocoel of the blastocyst. After iiyection, the 
blastocysts are returned to each uterine horn of pseudopregnant females. Females are then 
allowed to go to term and the resulting oflFspring screened for the construct. By providing for 
a different phenotype of the blastocyst and the genetically modified cells, chimeric progeny 
can be readily detected. 

The chimeric animals are screened for the presence of the modified gene and males 
and females having the modification are mated to produce homozygous progeny. If the gene 
alterations cause lethality at some point m development, tissues or organs can be maintained 
as allogeneic or congenic grafts or transplants, or in in vitro culture. The transgenic animals 
may be any non-human mammal, such as laboratory animals, domestic animals, etc. The 
transgenic animals may be used in fimctional studies, drug screening, etc., e.g. to determine 
the effect of a candidate drug on PDE interacting binding protein activity and/or the 
enzymatic activity of the PDE/PDE interacting protein complex. 

Production of In Vitro Models of PDF Inte racting Protein Fnnrtinn 

One can also use the polypeptide compositions of the subject invention to produce in 
vitro models of PDE interacting protein fimction. In addition to the subject PDE interacting 
protein, such models will generally include at least a PDE as well as a cyclic nucleotide, and a 
means to monitor the activity of the enzyme in the presence of the PDE interacting protein, 
e.g, a labeled isotope, etc. 

Diagnostic Applications 

Also provided are methods of diagnosing disease states associated with PDE 
interacting protein activity, e.g. based on observed levels of PDE interacting protein or the 
expression level of the gene in a biological sample of interest. Samples, as used herein, include 
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biological fluids such as semen, blood, cerebrospinal fluid, tears, saliva, lymph, dialysis fluid 
and the like; organ or tissue culture derived fluids; and fluids extracted from physiological 
tissues. Also included in the term are derivatives and fractions of such fluids. The cells may 
be dissociated, in the case of solid tissues, or tissue sections may be analyzed. Alternatively a 
lysate of the cells may be prepared. 

A number of methods are available for determining the expression level of a gene or 
protein in a particular sample. Diagnosis may be performed by a number of methods to 
determine the absence or presence or altered amounts of normal or abnormal PDE mteracting 
protein in a patient sample. For example, detection may utilize staining of cells or histological 
sections with labeled antibodies, performed in accordance with conventional methods. Cells 
are permeabilized to stain cytoplasmic molecules. The antibodies of interest are added to the 
ceU sample, and incubated for a period of time sufficient to allow binding to the epitope, 
usuaUy at least about 10 minutes. The antibody may be labeled with radioisotopes, enzymes, 
fluorescers, chemihiminescers, or other labels for dkect detection. Alternatively, a second 
stage antibody or reagent is used to amplify the signal. Such reagents are well known in the 
art. For example, the primary antibody may be conjugated to biotm, with horseradish 
peroxidase-conjugated avidin added as a second stage reagent. Alternatively, the secondary 
antibody conjugated to a flourescent compound, e,g, fluorescein, rhodamine, Texas red, etc. 
Final detection uses a substrate that undergoes a color change in the presence of the 
peroxidase. The absence or presence of antibody binding may be determined by various 
methods, including flow cytometry of dissociated cells, microscopy, radiography, scintillation 
counting, e/c. 

Alternatively, one may focus on the expression of the gene. Biochemical studies may 
be performed to determine whether a sequence polymorphism in an coding region or control 
regions is associated with disease. Disease associated polymorphisms may include deletion or 
truncation of the gene, mutations that alter expression level, that affect the activity of the 
protein, etc. 

Changes in the promoter or enhancer sequence that may affect expression levels of 
the gene can be compared to expression levels of the normal allele by various methods known 
in the art. Methods for determining promoter or enhancer strengdi include quantitation of the 
expressed natural protein; insertion of the variant control element into a vector with a 
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reporter gene such as P-galactosidase, luciferase, chloramphenicol acetyltransferase, etc, that 
provides for convenient quantitation; and the like, 

A number of methods are available for analyzing nucleic acids for the presence of a 
specific sequence, e.g, a disease associated polymorphism. Where large amounts of DNA are 
available, genomic DNA is used directly. Alternatively, the region of interest is cloned into a 
suitable vector and grown in suflScient quantity for analysis. Cells that express the subject 
gene may be used as a source of mRNA, which may be assayed directly or reverse transcribed 
into cDNA for analysis. The nucleic acid may be ampUfied by conventional techniques, such 
as the polymerase chain reaction (PGR), to provide sufficient amounts for analysis. The use 
of the polymerase chain reaction is described in Saiki, etaL (1985), Science 239:487, and a 
review of techniques may be found in Sambrook, et aL Molecular rinning- A r ,;^|.^^ ^|^yy 
MamiaL CSH Press 1989, pp.l4.2B14.33. Alternatively, various methods are known in the 
art that utihze oligonucleotide ligation as a means of detecting polymorphisms, for examples 
see Riley et al. (1990), NucL Acids Res. 18:2887-2890; and Delahunty et aL (1996), Am, J, 
Hum, Genet 58: 1239-1246. 

A detectable label may be included in an amplification reaction. Suitable labels inchide 
fluorochromes, e,g, fluorescein isothiocyanate (FITC), rhodamine, Texas Red, phycoerythrin, 
allophycocyanin, 6-carboxyfluorescein (6-FAM), 2',7'-dimethoxy-4',5'.dichloro-6. 
carboxyfluorescein (JOE), 6-carboxy-X-rhodamine (ROX), 6-carboxy-2',4',7',4,7- 
hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N\N'-tetramethyl-6. 
carboxyrhodamine (TAMRA), radioactive labels, e.g, ^^S, ^H; etc. The label may be a 
two stage system, where the ampUfied DNA is conjugated to biotin, haptens, etc, having a 
high affinity binding partner, e,g. avidin, specific antibodies, etc, where the-binding partner is 
conjugated to a detectable label. The label may be conjugated to one or both of the primers. 
Alternatively, the pool of nucleotides used in the amplification is labeled, so as to incorporate 
the label into the amplification product. 

The sample nucleic acid, e.g, amplified or cloned fi-agment, is analyzed by one of a 
number of methods known in the art. The nucleic acid may be sequenced by dideoxy or other 
methods, and the sequence of bases compared to a wild-type sequence. Hybridization with 
the variant sequence may also be used to determine its presence, by Southern blots, dot blots, 
etc. The hybridization pattern of a control and variant sequence to an array of 
oUgonucleotide probes inamobilized on a solid support, as described in US 5,445,934, or in 
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WO 95/35505, may also be used as a means of detecting the presence of variant sequences. 
Single strand conformational polymorphism (SSCP) analysis, denaturing gradient gel 
electrophoresis (DGGE), and heteroduplex analysis in gel matrices are used to detect 
conformational changes created by DNA sequence variation as alterations in electrophoretic 
mobility. Alternatively, where a polymorphism creates or destroys a recognition site for a 
restriction endonuclease, the sample is digested with that endonuclease, and the products size 
fractionated to determine whether the fragment was digested. Fractionation is performed by 
gel or capillary electrophoresis, particularly acrylamide or agarose gels. 

Screening for mutations may be based on the functional or antigenic characteristics of 
Uie protein. Protein truncation assays are useful in detecting deletions that may affect the 
biological activity of the protein. Various immunoassays designed to detect polymorphisms in 
tiie subject PDE interacting proteins may be used in screening. Where many diverse genetic 
mutations lead to a particular disease phenotype, functional protein assays have proven to be 
effective screening tools. The activity of the encoded protem may be determined by 
comparison with the wild-type protein- 
Diagnostic methods of the subject invention in which the level of expression is of 
interest will typically involve comparison of the PDE interacting protein nucleic acid 
abimdance of a sample of interest with that of a control value to determine any relative 
differences, where the difference may be measured qualitatively and/or quantitatively, which 
differences are then related to the presence or absence of an abnormal gene expression 
pattern. A variety of different methods for determining the nucleic acid abundance in a sample 
are known to those of skill in the art, where particular methods of interest include those 
described in: Pietu et al.. Genome Res. (June 1996) 6: 492-503; Zhao et al., Gene (April 24, 
1995) 156: 207-213; Soares . Curr. Opin. Biotechnol. (October 1997) 8: 542-546; Raval, J. 
Pharmacol Toxicol Metiiods (November 1994) 32: 125-127; Chalifqur et al., Anal. Biochem 
(February 1, 1994) 216: 299-304; Stolz & Tuan, Mol. Biotechnol. (December 19960 6: 225- 
230; Hong et al,. Bioscience Reports (1982) 2: 907; and McGraw, Anal. Biochem. (1984) 
1 43 : 298. Also of interest are the methods disclosed in WO 97/273 1 7, tiie disclosure of which 
is herein incorporated by reference. 
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Screening Assays 

The subject PDE interacting proteins and polypeptides find use in various screening 
assays designed to identify therapeutic agents. The screening assays may be designed to 
identify agents that modulate, e.g. inhibit or enhance, the activity of the PDE interacting 
5 protein directly and thereby modulate the activity of the particular PDE that depends on the 
presence of the PDE interacting protein for its fimction. Altmiatively, the assay may be 
designed to identify those agents that modify, e.g. enhance or inhibit, the activity of the PDE 
when present as a conq)lex with the PDE interacting protein. 

Of particular interest &re screening methods that provide for qualitative/quantitative 

10 measurements of a PDE enzyme activity in the presence of a particular candidate therapeutic 
agent and its PDE mteracting protein, as such screening methods are capable of identifying 
highly selective PDE modulatory, e.g. inhibitory, agents. For example, the assay could be an 
assay which measures the activity of a PDE interacting protein/enzyme complex in the 
presence and absence of a candidate inhibitor agent. In this preferred screening assay 

1 5 embodiment, the PDE interacting protein/PDE complex will generally be a naturally occurring 
complex, i.e. a complex between a cyclic nucleotide PDE and its naturally occurring PDE 
interacting protein partner. Of particular interest are complexes between a cAMP-PDEIV and 
a myomegalin protein. 

The screening method may be an in vitro or in vivo format, where both formats are 

20 readily developed by those of skill in the art. Depending on the particular method, one or 
more of, usually one of; the components of the screening assay may be labeled, where by 
labeled is meant that the components comprise a detectable moiety, e,g. a fluorescent or 
radioactive tag, or a member of a signal producing system, e.g. biotin for binding to an 
enzyme-streptavidin conjugate in which the enzyme is capable of converting a substrate to a 

25 chromogenic product. 

A variety of other reagents may be included in the screening assay. These include 
reagents like salts, neutral proteins, e.g. albumin, detergents, etc. that are used to facilitate 
optimal protein-protein binding and/or reduce non-specific or backgroimd interactions. 
Reagents that improve the efficiency of the assay, such as protease inhibitors, nuclease 

30 inhibitors, anti-microbiai agents, etc. may be used. Specific PDE activity assays of interest 
include those described in U.S. Patent Nos. 5,798,373 and 5,580,888, the disclosures of 
which are herein incorporated by reference. 
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A variety of different candidate agents may be screened by the above methods. 
Candidate agents encompass numerous chemical classes, though typically they are organic 
molecules, preferably small organic compounds having a molecular weight of more than 50 
and less than about 2,500 daltons. Candidate agents comprise functional groups necessary 
for structural interaction with proteins, particularly hydrogen bonding, and typically include at 
least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the fimctional 
chemical groups. The candidate agents often comprise cyclical carbon or heterocyclic 
structures and/or aromatic or polyaromatic structures substituted with one or more of the 
above functional groups. Candidate agents are also found among biomolecules including 
peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural 
analogs or combinations thereof 

Candidate agents are obtained from a wide variety of sources including libraries of 
synthetic or natural compounds. For example, numerous means are available for random and 
directed synthesis of a wide variety of organic compounds and biomolecules, including 
expression of randomized oligonucleotides and oligopeptides. Alternatively, libraries of 
natural compounds in the form of bacterial, fimgal, plant and animal extracts are available or 
readily produced. Additionally, natural or synthetically produced libraries and compounds are 
readily modified through conventional chemical, physical and biochemical means, and may be 
used to produce combinatorial libraries. Known pharmacological agents may be subjected to 
directed or random chemical modifications, such as acylation, alkylation, esterification, 
amidification, etc, to produce structural analogs. 

PDE Interacting Protein Nucleic Acid and Polypeptide Therapeutic Compositions 

The nucleic acid compositions of the subject invention also find use as therapeutic 
agents in situations where one wishes to enhance the PDE mteracting protem activity in a 
host, e.g. in a m a mm a li a n host in which PDE interacting protein activity is sufficiently low 
such that a disease condition is present, etc. The PDE interacting protein genes, gene 
fi-agments, or the encoded proteins or protein fi*agments are useful in gene therapy to treat 
disorders associated with defects the PDE interacting protein gene expression. Expression 
vectors may be used to introduce the gene into a cell. Such vectors generally have convenient 
restriction sites located near the promoter sequence to provide for the insertion of nucleic 
acid sequences. Transcription cassettes may be prepared comprising a transcription initiation 
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region, the target gene or fragment thereof, and a transcriptional termination region. The 
transcription cassettes may be introduced into a variety of vectors, e.g, plasmid; retrovirus, 
e.g. lentivinis; adenovirus; and the like, where the vectors are able to transiently or stably be 
maintained in the cells, usually for a period of at least about one day, more usually for a 
period of at least about several days to several weeks. 

The gene or protein may be introduced into tissues or host cells by any number of 
routes, including viral infection, microinjection, or fusion of vesicles. Jet iiyection may also be 
used for intramuscular admmistration, as described by Furth et al. (1992), Anal Biochem 
205:365-368. The DNA may be coated onto gold microparticles, and delivered intradermally 
by a particle bombardment device, or "gene gun" as described in the literature (see, for 
example, Tang et al (1992), Nature 356: 152-154), where gold microprojectiles are coated 
with the DNA, then bombarded into skin cells. 

MEraoDs OF Modulating PDE Ii^teracting Protein AcTrvny in a Host 

Also provided are methods of regulating, including enhancing and inhibiting, PDE 
interacting protein activity in a host. Where the PDE interacting protem activity occurs in 
vivo in a host, an effective amount of active agent that modulates the activity, e.g. reduces the 
activity, of the PDE interacting protein in vivo (e.g. the activity of the naturally occurring 
PDE/interacting protein complex), is administered to the host. The active agent may be a 
variety of different compounds, including a naturally occurring or synthetic small molecule 
compound, an antibody, fragment or derivative thereof an antisense composition, and the 
like. 

Naturally occurring or synthetic small molecule con:^)ounds of interest inchide 
numerous chemical classes, though typically they are organic molecules, preferably small 
organic compounds having a molecular weight of more than 50 and less than about 2,500 
daltons. Candidate agents comprise functional groups necessary for structural interaction 
with proteins, particularly hydrogen bonding, and typically include at least an amine, carbonyl, 
hydroxyl or carboxyl group, preferably at least two of the functional chemical groups. The 
candidate agents often comprise cyclical carbon or heterocyclic structures and/or aromatic or 
polyaromatic structures substituted with one or more of the above fimctional groups. 
Candidate agents are also found among biomolecules including peptides, saccharides, fatty 
acids, steroids, purines, pyrimidines, derivatives, structural analogs or combinations thereof 
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Also of interest as active agent are antibodies that modulate, e.g. reduce, if not inhibit, 
PDE interacting protein activity in the host. Suitable antibodies are obtained by immnniy^ng a 
host animal with peptides comprising all or a portion of the subject proteins, such as found in 
the polypeptide compositions of the subject invention. Suitable host animals include mouse, 
5 rat sheep, goat, hamster, rabbit, etc. The origin of the protein immunogen may be mouse, 
human, rat, monkey etc. The host animal will generally be a different species than the 
mmiunogen, e.g. humian protein used to immunize mice, etc. 

The immunogen may comprise the complete protein, or fragments and derivatives 
thereof Preferred immunogens comprise all or a part of the PDE interacting protein, where 
1 0 these residues contain the post-translation modifications, such as glycosylation, found on the 
native protein. Immunogens comprising the extracellular domain are produced in a variety of 
ways known in the art, e.g. expression of cloned genes using conventional recombinant 
methods, isolation from HEC, etc. 

For preparation of polyclonal antibodies, the first step is immunization of the host 
1 5 animal with the immunogen, where the immunogen will preferably be in substantially pure 
form, comprismg less than about 1% contaminant. The immunogen may comprise complete 
PDE interacting protein, fragments or derivatives thereof To increase the immune response 
of the host animal, the protein or peptide may be combined with an adjuvant, where suitable 
adjuvants include alum, dextran, sulfate, large polymeric anions, oil & water emulsions, e.g. 
20 Freund's adjuvant, Freund's complete adjuvant, and the like. The unmunogen may also be 
conjugated to synthetic carrier proteins or synthetic antigens. A variety of hosts may be 
immunized to produce the polyclonal antibodies. Such hosts include rabbits, guinea pigs, 
rodents, e.g. mice, rats, sheep, goats, and the Uke. The immunogen is administered to the 
host, usually intradermally, with an initial dosage followed by one or more, usually at least 
25 two, additional booster dosages. Following immunization, the blood from the host will be 
collected, followed by separation of the serum from the blood cells. The Ig present in the 
resultant antisenmi may be further fractionated using known methods, such as anmionium salt 
fractionation, DEAE chromatography, and the like. 

Monoclonal antibodies are produced by conventional techniques. Generally, the 
30 spleen and/or lyn^h nodes of an immunized host animal provide a source of plasma cells. 
The plasma cells are immortalized by fusion with myeloma cells to produce hybridoma cells. 
Culture supernatant from individxial hybridomas is screened using standard techniques to 
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identify those producing antibodies with the desired specificity. Suitable animals for 
production of monoclonal antibodies to the human protein include mouse, rat, hamster, etc. 
To raise antibodies against the mouse protein, the animal will generally be a hamster, guinea 
pig, rabbit, etc. The antibody may be purified firom the hybridoma cell supematants or ascites 
5 fluid by conventional techniques, e.g. aflBnity chromatography using PDE-interacting protein 
. bound to an insoluble support, protein A sepharose, eta 

The antibody may be produced as a single chain, instead of the normal multimeric 
structure. Smgle chain antibodies are described in Jost et ai (1994) 269:26267-73, 
and others. DNA sequences encoding the variable region of the heavy chain and the variable 
1 0 region of the light chain are ligated to a spacer encoding at least about 4 amino acids of small 
neutral amino acids, including glycine and/or serine. The protein encoded by this fusion 
allows assembly of a functional variable region that retains the specificity and affinity of the 
original antibody. 

For in vivo use, particularly for injection into humans, it is desirable to decrease the 

1 5 antigenicity of the antibody. An immune response of a recipient against the blocking agent 
will potentially decrease the period of time that the therapy is eflFective. Methods of 
humanizing antibodies are known in the art. The humanized antibody may be the product of 
an animal having transgenic human immunoglobulin constant region genes (see for example 
International Patent Applications WO 90/10077 and WO 90/04036), Alternatively, the 

20 antibody of interest may be engineered by recombinant DNA techniques to substitute the 
CHI, CH2, CH3, hinge domains, and/or the fi'amework domain with the corresponding 
human sequence (see WO 92/02190). 

The use of Ig cDNA for construction of chuneric immunoglobulin genes is known in 
theart(Lhie/a/. (1987) EKA^ 84:3439 and (1987) LJtanunoL 139:3521). mRNAis 

25 isolated fi-om a hybridoma or other ceil producmg the antibody and used to produce cDNA. 
The cDNA of interest may be amplified by the polymerase chain reaction using specific 
primers (U.S. Patent nos. 4,683,195 and 4,683,202). Alternatively, a library is made and 
screened to isolate the sequence of interest. The DNA sequence encoding the variable region 
of the antibody is then fused to human constant region sequences. The sequences of himian 

30 constant regions genes may be found in Kabat et al (1991) Sequences of Proteins of 

Tmniiinril^grjca] Tnterest, N.LH. publication no. 91-3242. Human C region genes are readily 
available fi-om known clones. The choice of isotype will be guided by the desired eflfector 
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functions, such as complement fixation, or activity in antibody-dependent cellular 
cytotoxicity. Preferred isotypes are IgGl, IgG3 and IgG4. Either of the human light chain 
constant regions, kappa or lambda, may be used. The chimeric, humanized antibody is then 
expressed by conventional methods. 
5 Antibody fi'agments, such as Fv, F(ab)2 and Fab may be prepared by cleavage of the 

intact protein, e.g. by protease or chemical cleavage. Alternatively, a truncated gene is 
designed. For example, a chimeric gene encoding a portion of the F(ab)2 fi-agment would 
include DNA sequences encoding the CHI domain and hinge region of the H chain, followed 
by a translation^ stop zodon to yield the truncated molecule. 
10 Consensus sequences of H and L J regions may be used to design oligonucleotides for 

use as primers to introduce useful restriction sites into the J region for subsequent linkage of 
V region segments to human C region segments. C region cDNA can be modified by site 
directed mutagenesis to place a restriction site at the analogous position in the human 
sequence. 

15 Expression vectors inchide plasmids, retroviruses, YACs, EBV derived episomes, and 

the like. A convenient vector is one that encodes a fiinctionally complete hiiman CH or CL 
immimoglobulin sequence, with appropriate restriction sites engineered so that any VH or VL 
sequence can be easily inserted and expressed. In such vectors, splicing usually occurs 
between the splice donor site in the inserted J region and the splice acceptor site preceding 

20 the human C region, and also at the splice regions that occur within the human CH exons. 
Polyadenylation and transcription termination occur at native chromosomal sites downstream 
of the coding regions. The resulting chimeric antibody may be joined to any strong promoter, 
including retroviral LTRs, e.g. SV-40 early promoter, (Okayama et al. (1983) Mol. Cell. Bio 
3:280), Rous sarcoma virus LTR (Gonnan et al. (1982) FN A S 79:6777), and moloney 

25 murine leukemia virus LTR (Grosschedl et al. (1985) Cell 41:885); native Ig promoters, etc. 

In yet other embodiments of the invention, the active agent is an agent that modulates, 
and generally decreases or down regulates, the expression of the gene in the host. Antisense 
molecules can be used to down-regulate expression of the protein in cells. The anti-sense 
reagent may be antisense oligonucleotides (ODN), particularly synthetic ODN having 

30 chemical modifications fi*om native nucleic acids, or nucleic acid constructs that express such 
anti-sense molecules as RNA. The antisense sequence is complementary to the mRNA of the 
targeted gene, and inhibits expr ssion of the targeted gene products. Antisense molecules 
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inhibit gene expression through various mechanisms, e.g. by reducing the amount of mRNA 
available for translation, through activation of RNAse H, or steric hindrance. One or a 
combination of antisense molecules may be administered, where a combination may comprise 
multiple different sequences. 

Antisense molecules may be produced by expression of all or a part of the target gene 
sequence in an appropriate vector, where the transcriptional initiation is oriented such that an 
antisense strand is produced as an RNA molecule. Alternatively, the antisense molecule is a 
synthetic oligonucleotide. Antisense ohgonucleotides will generally be at least about 7, 
usually at least about 12, more usually at least about nucleotides in length, and not more 
than about 500, usually not more than about 50, more usually not more than about 35 
nucleotides in length, where the length is governed by efficiency of inhibition, specificity, 
including absence of cross-reactivity, and the hke. It has been found that short 
oligonucleotides, of from 7 to 8 bases in length, can be strong and selective inhibitors of gene 
expression (see Wagner et ai (1996), Nature Bioiechnol. 14:840-844). 

A specific region or regions of the endogenous sense strand mRNA sequence is 
chosen to be complemented by the antisense sequence. Selection of a specific sequence for 
the oligonucleotide may use an empirical method, where several candidate sequences are 
assayed for inhibition of expression of the target gene in an in vitro or animal model. A 
combination of sequences may also be used, where several regions of the mRNA sequence 
are selected for antisense complementation. 

Antisense oligonucleotides may be chemicaUy synthesized by methods known in the 
art (see Wagner et al. (1993), supra, and MilUgan et ai, supra.) Preferred oligonucleotides 
are chemically modified from the native phosphodiester structure, in order to increase their . 
intracellular stability and binding aflSnity. A number of such modifications have been 
described in the hterature, which alter the chemistry of the backbone, sugars or heterocyclic 
bases. 

Among usefid changes in the backbone chemistry are phosphorothioates; 
phosphorodithioates, where both of the non-bridging oxygens are substituted with sulfiir; 
phosphoroamidites; alkyl phosphotriesters and boranophosphates. Achiral phosphate 
derivatives inchide 3'.0'-5'-S-phosphorothioate, 3'-S-5'-0-phosphorothioate, 3'-CH2-5'-0. 
phosphonate and 3 '-NH-5 '-0-phosphoroamidate. Peptide nucleic acids replace the entire 
ribose phosphodiester backbone with a peptid linkage. Sugar modifications are also used to 
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enhance stability and affinity. The a-anomer of deoxyribose may be used, where the base is 
inverted with respect to the natural P-anomer The 2'-0H of the ribose sugar may be altered 
to form 2'-0-methyl or 2'-0"allyl sugars, which provides resistance to degradation without 
comprising affinity. Modification of the heterocyclic bases must mamtain proper base pairing. 
Some useful substitutions include deoxyuridine for deoxythymidine; 5-methyl-2'- 
deoxycytidine and 5-bromo-2'-deoxycytidine for deoxycytidine. 5- propynyl-2 '-deoxyuridine 
and 5-propynyl-2 '-deoxycytidine have been shown to increase affinity and biological activity 
when substituted for deoxythymidine and deoxycytidine, respectively. 

As an alternative to anti-sense inhibitors, catalytic nucleic acid compounds, e g, 
ribozymes, anti-sense conjugates, etc. may be used to mhibit gene expression, Ribozymes 
may be synthesized in vitro and administered to the patient, or may be encoded on an 
expression vector, fi-om which the ribozyme is synthesized in the targeted cell (for example, 
see International patent appUcation WO 9523225, and Beigelman et aL (1995), NucL Acids 
Res, 23:4434-42). Examples of oligonucleotides with catalytic activity are described in WO 
9506764. Conjugates of anti-sense ODN with a metal complex, e.g. terpyridylCu(II), capable 
of mediating mRNA hydrolysis are described in Bashkin et aL (1995), AppL Biochem. 
BiotechnoL 54:43-56. 

As mentioned above, an effective amount of the active agent is administered to the 
host, where "effective amount" means a dosage sufficient to produce a desired result, where 
the desired result in the desired modulation, e.g. enhancement, reduction, of PDE mteracting 
protein activity, which in turn leads to a desired effect on the state of the disease condition 
being treated, e.g. a reduction in the level of inflanmiation, etc. 

In the subjea methods, the active agent(s) may be administered to the host using any 
convenient means capable of resulting in the desired inhibition of PDE interacting protein 
activity. Thus, the agent can be incorporated into a variety of formulations for therapeutic 
admmistration. More particulariy, the agents of the present invention can be formulated into 
pharmaceutical compositions by combination with appropriate, pharmaceutically acceptable 
carriers or diluents, and may be formulated mto preparations in solid, semi-solid, liquid or 
gaseous forms, such as tablets, capsules, powders, granules, ointments, sohitions, 
suppositories, injections, inhalants and aerosols. 
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As such, administration of the agents can be achieved in various ways, including oral, 
buccal, rectal, parenteral, intr^eritoneal, intradermal, transdermal, intracheal,etc., 
administration. 

In pharmaceutical dosage forms, the agents may be administered in the form of their 
pharmaceutically acceptable salts, or they may also be used alone or in appropriate 
association, as well as in combination, with other pharmaceutically active compounds. The 
following methods and excipients are merely exemplary and are in no way limiting. 

For oral preparations, the agents can be used alone or in combination with appropriate 
additives to make tablets, powders, granules or capsules, for example, wi± conventional 
additives, such as lactose, mannitol, com starch or potato starch; with binders, such as 
crystalline cellulose, cellulose derivatives, acacia, com starch or gelatins; with disintegrators, 
such as com starch, potato starch or sodium carboxymethylcellulose; with lubricants, such as 
talc or magnesium stearate; and if desired, with diluents, buffering agents, moistening agents, 
preservatives and flavoring agents. 

The agents can be formulated into preparations for injection by dissolving, suspending 
or emulsifying them in an aqueous or nonaqueous solvent, such as vegetable or other similar 
oils, synthetic aliphatic acid glycerides, esters of higher aliphatic acids or propylene glycol; 
and if desired, with conventional additives such as solubilizers, isotonic agents, suspending 
agents, emulsifying agents, stabilizers and preservatives. 

The agents can be utilized in aerosol foraiulation to be administered via inhalatioa 
The compounds of the present invention can be formulated into pressurized acceptable 
propellants such as dichlorodifluoromethane, propane, nitrogen and the like. 

Furthermore, the agents can be made into suppositories by mixing with a variety of 
bases such as emulsifying bases or water-sohible bases. The compounds of the present 
invention can be administered rectally via a suppository. The suppository can include vehicles 
such as cocoa butter, carbowaxes and polyethylene glycols, which melt at body temperature, 
yet are solidified at room temperature. 

Unit dosage forais for oral or rectal administration such as syrups, elixirs, and 
suspensions may be provided wherein each dosage unit, for example, teaspoonfiil, 
tablespoonful, tablet or suppository, contains a predetermined amount of the composition 
containing one or more mhibitors. Similarly, unit dosage forms for injection or intravenous 
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administration may comprise the inhibitor(s) in a composition as a solution in sterile water, 
normal saline or another phannaceutically acceptable carrier. 

The term "unit dosage form," as used herein, refers to physically discrete units suitable 
as unitary dosages for human and animal subjects, each unit containing a predetennmed 
quantity of compounds of the present invention calculated in an amount suflBcient to produce 
the desired effect in association with a phannaceutically acceptable diluent, carrier or vehicle. 
The specifications for the novel unit dosage forms of the present mvention depend on the 
particular compound employed and the effect to be achieved, and the pharmacodynamics 
associated with each compound in the host. 

The phannaceutically acceptable excipients, such as vehicles, adjuvants, carriers or 
diluents, are readily available to the public. Moreover, pharmaceutically acceptable auxiliary 
substances, such as pH adjusting and buffering agents, tonicity adjusting agents, stabilizers, 
wetting agents and the like, are readily available to the public. 

Where the agent is a polypeptide, polynucleotide, analog or mimetic thereof, e.g. 
antisense composition, it may be introduced into tissues or host cells by any number of routes, 
inchidmg viral infection, microinjection, or fusion of vesicles. Jet injection may also be used 
for intramuscular administration, as described by Furth et al. (1992), AnalBiochem 205:365- 
368. The DNA may be coated onto gold microparticles, and delivered intradermally by a 
particle bombardment device, or "gene gun" as described in the literature (see, for example, 
Tang ei al (1 992), Nature 356: 152-1 54), where gold microprojectiles are coated witii the 
DNA, then bombarded into skin cells. 

Those of skill in die art will readily appreciate that dose levels can vary as a function 
of the specific compound, the severity of the symptoms and the susceptibility of tiie subject to 
side effects. Prefened dosages for a given conq)ound are readily determinable by those of skill 
in the art by a variety of means. 

The subject methods find use in the treatment of a variety of different disease 
conditions involving PDE interacting protein activity, particularly in those disease conditions 
in which the selective inhibition of PDE activity, more particularly PDEIV activity, results in 
treatment of the disease condition where targeting of the PDE interacting protein by the 
therapeutic agent results in modulated, e.g. reduced or enhanced, activity of its corresponding 
PDE. 
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Specific disease of interest as treatable by the subject methods include: asthma, 
including inflamed lung associate asthma, cystic fibrosis, inflammatory airway disease, chronic 
bronchitis, eosinophilic granuloma, psoriasis and other benign and malignant proliferative skin 
diseases, endotoxic shock, septic shock, ulcerative colitis, Crohn's disease, reperfiision injury, 
5 or the myocardium and brain, inflammatory arthritis, chronic gloerulonephritis, atopic 

dermatitis, urticaria, adult respiratory distress syndrome, diabetes insipidus, allergic rhinitis, 
allergic conjunctivitis, vernal conjunctivitis, arterial restinosis and artherosclerosis, 
infl a mm a t ory diseases associated with irritation and pain, rheumatoid arthritis, ankylosing 
spondylitis, transplant rejection and graft versus host disease, disease conditions associated 
10 with hypersecretion of gastric acid, disease conditions in which cytokines are mediators, e.g. 
sepsis, and septic shock, and the Uke. 

By treatment is meant at least an amelioration of the symptoms associated with the 
pathological condition aflElicting the host, where amelioration is used in a broad sense to refer 
to at least a reduction in the magnitude of a parameter, e.g. symptom, associated with the 
1 5 pathological condition being treated, such as inflanmiation, etc. As such, treatment also 
includes situations where the pathological condition, or at least symptoms associated 
therewith, are completely mhibited, e.g. prevented fi'om happening, or stopped, e.g 
terminated, such that the host no longer suffers firom the pathological condition, or at least 
the symptoms that characterize the pathological condition. 
20 A variety of hosts are treatable according to the subject methods. Generally such hosts 

are "mammals" or "mammalian," where these tenns are used broadly to describe organisms 
which are within the class mammalia, including the orders carnivore (e,g., dogs and cats), 
rodentia {e.g., mice, guinea pigs, and rats), and primates (e.g, humans, chimpanzees, and 
monkeys). In many embodiments, the hosts will be humans. 
25 Kits with unit doses of the active agent, usually in oral or injectable doses, are 

provided. In such kits, in addition to the containers contaimng the unit doses will be an 
informational package insert describing the use and attendant benefits of the drugs in treating 
pathological condition of interest. Preferred compounds and unit doses are those described 
herein above. 

30 

The following examples are offered primarily for purposes of illustration. It will be 
readily apparent to those skilled in the art that the formulations, dosages, methods of 
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administration, and other parameters of this invention may be further modified or substituted 
in various ways without departing firom the spirit and scope of the invention. 

EXPERIMENTAT. 
I. Screening of the yeast two hybrid system cDNA brain library 

To identify proteins that interact with a PDE4, cDNA coding for the amino terminus 
of PDE4D3 or for a region corresponding to a.a. 1 14-672 were inserted into pGBT9 vectors 
and used for screening of a Matchmaker rat brain library subcloned in pGAD 10 vector 
(Clontech, Palo Alto, CA), The fi-agment encoding the autoinhibitory (UCR2), catalytic, and 
carboxy terminal domains of rPDE4D3 (aa 1 14-672) was amplified by PGR with the full- 
length cDNA using the following forward and reverse primers with incorporated restriction 
sites and Stop codon, EcoRI: 5' CGG AAT TCG AGG AGG CCT ACC AGA AAC 3' 
(GUPA4) (SEQ ID NO:06) and SallATAG: 5* TGA GTC GAC TAG GTG TCA AGG CAA 
CAA TGG TC 3' (GUPA3) (SEQ ID NO:07). The PGR products were cloned into 
EcoRI/Sall site of pGBT9 (Clontech) downstream of the Gal4 activation domain. The PGR 
was performed in presence of recombinant Pfu polymerase (Stratagene) at low cycle number 
(10 cycles) to ensure high fidelity reading. The msertions were entirely sequenced to confirm 
the correct reading frame and the sequence. Sequencing was performed by the Molecular 
Biology facility at Stanford University using the ABI PRISM Dye Terminator Cycle 
Sequencing Ready Reaction Kit with AmpliTaq DNA Polymerase, FS (Perkin Ehner). 

Of the positive clones isolated from the screening of the rat brain library, 1 87 gave 
strong positive signal while 81 gave only a weak signal. Of the strong positive clones, PBP46 
was fiirther characterized. This clone contained an insert of aprroximately 2.8 kb. The 
interaction of the clone with the PDE was confirmed by subcloning the cDNA Augment in 
both pGBT9 and pGADlO and by testing growth and p-galactosidase activity in the yeast two 
hybrid system. The clone continued to show strong interaction with the 1 .6 fragment of 
PDE4D3. 

n. Screening for the fiill length myomegaiin clone 

A homology search (BLAST) using the sequence of PBP46 clone showed no 
significant identity to sequences in any public domain database. This clone was then used to 
probe a blot with RNA from multiple tissues. A transcript of approximately 8.0-8.5 kb 
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hybridized to the probe in several tissues, the highest level of expression being observed in the 
rat skeletal muscle and heart. Lower levels of expression were detected in brain, Uver and 
lung. In the testis a transcript of 2.0-2.4 kb was consistently observed. The expression in the 
testis was confirmed by PCR and by screening a rat testis Ubrary. Two clones containing the 
3* end sequence of myomegalin were retrieved fi-om this library. 

To obtain the complete sequence of the 8.0-8.5 transcript, a rat skeletal muscle cDNA 
Ubrary was screened with the PBP46 cDNA. From this screening, 2 clones were retrieved. 
However, the clones did not yield a complete ORF. Screening was then repeated six more 
times with oligonucleotides corresponding to the 5' end of the longest clones. From this 
multiple screening, 21 overiapping clones were obtained. Merging of the sequences from the 
different clones yielded a 9 kb sequence, a size in agreement with the size of the transcript 
derived fi-om rat heart and skeletal muscles. See Fig. 2. Conceptual translation of the 
nucleotide sequence uncovered an open reading fi-ame of a protein of 2324 amino acids 
corresponding to a calculated MW of 26 1 kDa, See Fig. 1 . 

To analyze tissue distribution of the rat myomegalin transcripts. Northern blot analysis 
was performed using radioactively labeled probes corresponding to the 3' end (probe 1; 1000 
bp) and the 5' end (probe 2; 665 bp) of the myomegalin open reading fi-ame. Transcripts of 
various sizes were found in various tissues using either probe I or probe 2 or both. The 
results indicated that there are at least four different transcripts of rat myomegalin: two 
expressed in heart (7.5 and 5.9 kb); two in skeletal muscle (7.5 and 4.3 kb) and one in testis 
(2.5 kb). The 2.5 kb variant roughly corresponds to the PBP46 clone, and is expressed 
exclusively in rat testis. 

in. Screening of the EST/database 

To determine whether mouse or human sequences analogous to the rat myomegalm 
are present in public domam databases, the rat sequence was used for a BLAST search of 
GenBank and EST libraries. The foUovwng EST were retrieved. AA755885, AA110441, 
W23471, AA333456, AA489265. These sequences are more than 90% homologous to the rat 
sequence. Sequence AL021920 contains a genomic fragment from human chromosome 
Ip35.1-p36.21. Several exons overiap with the rat sequence from residue 1215 until residue 
1444. Thus myomegalin must reside on human chromosome lp35. l-p36. KIAA0454 
(accession # AB007923), KIAA0477 (acccesion # AB007946) are two clones containing 
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portion of the human myomegalin sequence since they are more than 90% homologous to the 
rat ORF. These human clones were merged to obtain a full length human sequence 
homologous to myomegalin. See Fig. 4. The human open reading frame coded for a protein 
of 2517 residues and a calculated molecular weight of 282. 1 kDa. See Fig. 5. 
5 Alignment of the human and rat sequence showed identity from aa 235 of rat 

myomegalin to the end. In the amino terminus region, the two sequences showed only weak 
homologies. The reason for this discrepancy is at present unclear. It is possible that it is due 
to species differences. The junction where the rat sequence diverges from the human was 
derived from four clones isolated from the rat skeletal muscle library, lessening the possibility 
10 that cloning artifact is at the basis of this discrepancy. The presence of the jimction was 
further confirmed by PGR analysis of rat heart mRNA (data not shown). However, further 
blast searches with the region encompassing the 5* end of myomegalin did not yield mouse 
EST fragments overlapping the junction. Conversely, several EST clones confirming the 
human junction were retrieved from human and mouse EST databases, 

15 

IV. Protein/protein interaction 

Several attempts were made to confirm the interaction between myomegalin and 
PDE4D3. However, due to the insolubility of the full length or truncated myomegalin 
immunoprecipitation experiments could not be performed. In an alternative approach, PBP46 
20 was cotransfected with PDE4D3 in COS 7 cells and the PDE activity was determined m the 
particulate fraction of the cell. If PDE4D3 interacts with PBP46, an increase m the particulate 
PDE activity would be expected. Two to three fold increase in the particulate PDE4D3 
activity was deteaed when plasmids containing PBP46 and PDE4D3 were cotransfected in 
C0S7 cells. 

25 

V. Subcellular localization of myomegalin 

To investigate the subcellular localization of myomegalin the PBP46 clone was 
subcloned in frame to a flag tag and e^qjressed in C0S7 cells. The recombinant protein thus 
obtained was entirely recovered in the particulatefraction and could be extracted only with 
30 buffer containing SDS. Expression in transfected cells was further assessed by 

unmunofluorescence (IF) using the flag antibody. The flag tagged recombmant protein 
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encoded in PBP46 was entirely localized in the Golgi/centrosomal region of C0S7 cells. No 
attempts were made to express the fiill-length rayomegalin cDNA. 

VI. Western blot analysis of muscle and testis extracts 

Polyclonal antibodies were raised in rabbit against peptides corresponding to the 
carboxyl terminus region of myomegalin. These antibodies recognize in testis a protein of 
approximately 64 kDa. In heart and muscle, proteins of 280,250 and 200 kDa were observed. 
It is at present unknown whether these are native proteins or products of proteolysis. When 
these antibodies were used for IF locaUzatioa a region corresponding to the 
Golgi/centrosomal region is intensely labeled. 

It is apparent from the above results and discussion that polynucleotides encoding 
novel mamma l i an PDE interacting proteins, such as myomegalin, as well as the novel 
polypeptides encoded thereby, are provided. The subject invention is important for both 
research and therapeutic appUcations. For example, identification of the subject PDE 
interacting proteins provides for the ability to screen potential PDE inhibitors with PDE/PDE 
interacting protein complexes, where the results of such screening procedures should be more 
indicative of in vivo activity of a potential agent than screening procedures in which PDE is 
used by itself Accordingly, the subject invention provides for a significant contribution to the 
art. 

All publications and patent appUcations cited in this specification are herein 
incorporated by reference as if each individual publication or patent ^plication were 
specifically and individually mdicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
admission that the present invention is not entitled to antedate such publication by virtue of 
prior inventioa 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it is readily apparent to 
those of ordinary skill in the art in Ught of the teachings of this invention that certain changes 
and modifications may be made thereto without departing from the spirit or scope of the 
appended claims. 
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WHATTSCLATMFn 

1 . A polynucleotide present in other than its natural environment encoding a PDE 
interacting polypeptide. 

2. The polynucleotide according to Claim 1, wherein said polynucleotide encodes 
a myomegalin protein. 

3 . A fragment of a polynucleotide according to Claim 1 . 

4. An PDE interacting polypeptide present in other than its naturally occurring 
environment. 

5. The polypeptide according to Clahn 4, wherein said polypeptide is a 
myomegalin protein. 

6. A fragment of a polypeptide according to Claim 4. 

7. Substantially pure PDE interacting protein. 

8. Isolated PDE interacting protein. 

9. An expression cassette comprising a transcriptional initiation region functional 
in an expression host, a polynucleotide having a nucleotide sequence found in the nucleic acid 
according to Claim 1 under the transcriptional regulation of said transcriptional initiation 
region, and a transcriptional termination region functional in said expression host. 

10. A cell comprising an expression cassette according to Claim 9 as part of an 
extrachromosomal element or integrated into the genome of a host cell as a result of 
introduction of said expression cassette into said host cell. 

1 1 . The cellular progeny of the cell according to Claim 10. 
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12. A method of producing an PDE interacting polypeptide, said method 
comprising: 

growing a cell according to Claim 10, whereby said polypeptide is expressed; and 
isolating said polypeptide substantially free of other proteins. 

5 

13. A monoclonal antibody binding specifically to a PDE interacting protein. 

14. The monoclonal antibody according to Claim 13, wherein said antibody 
inhibits the activity of at k ast one of PDE or a PDE interacting protein. 

10 

15. The monoclonal antibody according to Claim 13, wherein said antibody is a 
humanized antibody. 

16. A method of determining whether an agent modulates the activity of a PDE, 
1 S said method comprising: 

contacting a complex of said PDE and a PDE interacting protein with said agent; and 
determining the effect of said agent on the activity of said PDE. 

17. The method according to Claim 16, wherem said agent is a small molecule. 

20 

18. The method according to Claim 1 6, wherein said agent is an antibody. 

19. The method according to Claim 1 8, wherein said agent is a monoclonal 
antibody. 

25 

20. A method for modulating the activity of a PDE interactmg protein, said 
method comprising: 

contacting said PDE interacting protein with an agent that modulates the activity of 
said PDE interacting protein. 

30 
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1/12 



FIG. 1 



>myoraegalin protein 

MSNGYRTLSQHLNDLKKENFSLKLRIYFLEERMQQKYEVSREDVYKRNIELKVEVESLKRELQDRKQHL 

HKTWADEEDLNSQNEAELRRQVEEPQQETEHVYELLDNNIQLLQEESRFAKDEATQMETLVEAEKGCNL 

ELSERWKDATKNREDAPGDQVKLDQYSAALAQRDRRIEELRQSLAAQEGLVEQLSREKQQLLHLLEEPG 

GMEVQPMPKGLPTQQKPDLNETPTTQPSVSDSHLAELQDKIQQTEVTNKILQEKLNDMSCELRSAQESS 

QKQDTTIQSLKEMLKSRESETEELYQVIEGQNDTMAKLPEMLHQSQLGQLQSSEGIAPAQQQVALLDLQ 

SALFCSQLEIQKLQRLLRQKERQLADGKRCMQFVEAAAQEREQQKEAAWKHNQELRKALQHLQGELHSK 

SQQLHVLEAEKYNEIRTQGQNIQHLSHSLSHKEQLIQELQELLQYRDTTDKTLDTNEVFLEKLRQRIQD 

RAVALERVIDEKFSALEEKDKELRQLRLAVRDRDHDLERLRCVLSANEATMQSMESLLRARGLEVEQLI 

ATCQNLQWLKEELETKFGHWQKEQESIIQQLQTSLHDRNKEVEDLSATLLHKLGPGQSEVAEELCQRLQ 

RKERVLQDLLSDRNKQAMEHEMEVQGLLQSMGTREQERQAVAEKMVQAFMERNSELQALRQYLGGKELM 

AASQAFISNQPAGATSVGPHHGEQTDQGSTQMPSRDDSTSLTAREEASIPRSTLGDSDTVAGLEKELSN 

AKEELELMAKKERESQIELSALQSMMAVQEEELQVQAADLESLTRNIQIKEDLIKDLQMQLVDPEDMPA 

MERLTQE VLLLRE KVAS VE PQGQEGS ENRRQQLLLMLEGLVDERSRLNEALQAERQL YS S L VKFHAQPE 

ISERDRTLQVELEGAQVLRSRLEEVLGRSLERLSRLETLAAIGGATAGDETEDTSTQFTDSIEEEAAHN 

SHQQLIKVSLEKSLTTMETQNTCLQPPSPVGEDGNRHLQEEMLHLRAEIHQPLEEKRKAEAELKELKAO 

lEEAGFSSVSHIRNTMLSLCLCLENAELKEQMGEAMSDGWEVEEDKEKGEVMVETWAKGGLSEDSLQA 

EFRKVQGRLKSAYNIINLLKEQLVLRSSEGNTKEMPEFLVRLAREVDRMNMGLPSSEKHQHQEQENMTA 

RPGPRPQSLEaGTALSVDGYQLENKSQAQDSGHQPEFSLPGSTKHLRSQLAQCRQRYQDLQEKLLISEA 

TVFAQANQLEKYRAILSESLVKQDSKQIQVDLQDLGYETCGRSENEAEREETTSPECEEHGNLKPWLV 

EGLCSEQGYLDPVLVSSPVKNPWRTSQEARRIQAQGTSDNSSLLRKDIRNLKAQLPNAYKVLQNLRSRV 

RSLSATSDYSSSLERPRKLIAVATLEGASPHSVTDEDEGLLSDGTGAFYPPGLQAKKNLENLIQRVSQL 

EAQLPKTGLEGKLAEELKSASWPGKYDSLIQDQARKTVISASENTKREKDLFSSHPTFERYVKSFEDLL 

RNNDLTTYLGQSFREQLSSRRSVTDRLTSKFSTKDHKSEKEEVGLEPLAFRFSRELQEKEKVIEVLQAK 

VDTRFFSPPSSHAASESHRCASSTSFLSDDIEACSDMDVASEYTHYEEKKPSPSNSAASASQGLKGEPR 

SSSISLPTPQKPPKEASQAQPGFHFNSIPKPASLSQAPMHFTVPSFMPFGPSGPPLLGCCETPWSLAE 

AQQELQMLQKQLGRSVSIAPPTSTSTLLSNHTEASSPRYSNPAQPHSPARGTIELGRILEPGYLGSGQW 

DtM^PQKGSISGELSSGSSMYQLNSKPTGADLLEEHLGEIRNLRQRLEESICVNDRLREQLQHRLSSTA 

RENGSTSHFYSQGLESMPQLYNENRALREENQSLQTRLSHASRGHSQEVDHLREALLSSSSQLQELEKE 

LEQQKAERRQLLEDLQEKQDEIVHFREERLSLQENNSRLQHKUaLQQQdEEKQQLSLSLQSELQIYES 

LYENPKKGLKAFSLDSCYQVPGELSCLVAEIRALRVQLEQSIQVNNRLRLQLEQQMDHGAGKASLSSCP 

VNQSFSAKAELANQQPPFQGSAASPPVRDVGLNSPPWLPSNSCSVPGSDSAIISRTNNGSDESAATKT 

PPKMEVDAADGPFASGHGRHVIGHVDDYDALQQQIGEGKLLIQKILSLTRPARSVPALDAQGTEAPGTK 

SVHELRSSARALNHSLEESASLLTMFWRAALPNSHGSVLVGEEGNI^MEKELLDLRAQVSQQQQLLQSTA 

VRLKTANQRKKSMEQFIVSHLTRTHDVLKKARTNLEMKSFRALMCTPAL (SEQ ID NO: 01) 
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FIG. 2 >MYOMEGALIN complete DNA 
CCGGTCCCCTTTGGTAGTAGTATCTCAGAGCTCGCCCCATAGTTTCATAGTTCATGTCTGGTTTGTTCT 
TATGCTTTCCCCAGAGCTTCGAGACAGCCTTTGAGTCCACCAGCTTGAATATGCCCTTTTCTCTCTGAG 
TCCATTTAATATACCTGGGACAAGTATTTTTATCTTGAAGCAGATCTAAAAGAAACTCCCACAGATAGG 
TTGTGTTTCCTTTTCCTTCTCTGGCTTTCTTCTTGACTCCTAACTCAGGAGACCCATTGGAAACTGGTG 
ACTGCTGGGTCTTTGGTTTACGGCCAACTTTCTTCTTTTTCATTGGTTCGTGGCTGTCTGGTGAAGTAT 
GGATAGGCGAGGCATCCATTGGTTCAGACTCCTCTGTTGACACCTCCACTACAG7CTCCGTAATGACAT 
CTGGCCTCATCGCAGCATGGATAAAATCGGATTCTTGAATCCTCAAGCAGGTAGGAGACTCCATATGAA 
GCAGGGCTTCAGCAGCTTCAATGGTCTTATCCGTACAGTGTGCATTACTGCTGTGAACTGATGCTTCCA 
CGGCCCGGATGAGCAGATCCAGCTGGTTCGTGGGTCCCTCATGCAGAGACGTCGCCATGTTTATCCCGG 
GGCTGGAACTGCTGCTTCACATTGACTTACACCCTGAGCAGCGGCGACAGGGGAGAAGGCGGAACCCGC 
GGCCGGAGACACACGCCGTGCGGGCGGCACACACTCACGCACTCGCACACACTCCGACGCCCGGATCCT 
TGCGCGTCCTCCGACAGGAAGCGGCGGCCGGCCCGCCTCCCGCCGCCGGGCTGAGCAGCCCCACCACCT 
AACGGCAGGGGCGGCGGCGGCCCCGGCTGGCAACGCGATCCTTCCGCCCCGCGCCCAGACAGGAAGTCC 
CGGGCGCCGGCAGCCAGCGGCCGCACGGACACCTGAGGCTGGGGAGCCCGCAGGCCGCCCTCGGGGACG 
CGGGCCTCGGCAGGAAAAGGCGCGCTTCACGTTCTGCGGAAGCGAAGTCTGCAAATGTCCCCTCAGCAT 
GGTCTTCCTCCTGGCTCAATCTGTCTCACCTTCAGGTGATCCTAGGACTGGGGCTCCTTTCCAGGTCCC 
CAGTTTCTCAAGTCGATCTTCTACCTCCCTCTTGA7TTTCTACTCCATTGCTGGAAAGCTCCAGAACAG 
AGCCTCCGCCGCCAACCACTGCTGATGCCATCGCGTCTTCCCTGAGCAAGTTTCGAACGCTGCGAATCA 
ATGTAATTACGGCTCAGATGATTGCCAGGGTTATCGGTTTCATGTTCTAATTCAATAGTGATGGAGTAG 
ACATCCAGAAGTCCAGTCTTCTAAAGATGATTAACCAGAGGGTAGTTTGACGGTTAAGTAGTCTAAGCA 
TCCTTCACCGTTTCCACACTCCCAAGAGCTGAACTCTAAACCAGCAGCTCTCTGGAGCTACTGCTCTCC 
CTCCACGTCGCCGTGTCCCTTGCCCTTCCCCTCAGGGCCGCAGACCGGCCGAGCCGCCGCAGCCGCCCG 
CCGTTGGCCCGGCGTCCTGCGGGAAGCCGAGGGGGCCTCCCCGGGCCACCGCGCGAGCCGCTCCGCACC 
ACAGGACGAGACAAACCGCGGCTATGTCGCCTTAGCCCTCGGGGTCCCACAGCCTCAGCAGCGTCCTAG 
CCTGCCCGCTCCATGCCACGGCAAGGCTGCACCGTGTTCCAGGGGTGAAGGGGGCGATCGGGCATGCTC 
CTCCCCATGGGTCGCCCACCATGTCTAATGGATATCGCACTCTGTCCCAGCACCTCAATGACCTGAAGA 
AGGAGAACTTCAGCCTCAAGCTGCGCATCTACTTCCTGGAGGAGCGCATGCAACAGAAGTATGAAGTCA 
GCCGGGAGGACGTCTACAAGCGGAACATTGAGCTGAAGGTTGAAGTGGAGAGCCTGAAACGAGAGCTCC 
AGGACAGGAAACAGCATCTACATAAAACATGGGCCGATGAGGAGGATCTCAACAGCCAGAATGAAGCAG 
AGCTCCGGCGCCAGGTTGAAGAACCGCAGCAGGAGACAGAACACGTTTATGAGCTCCTAGACAACAACA 
TTCAGCTGCTGCAGGAGGAATCCAGGTTTGCAAAGGATGAAGCCACACAGATGGAGACTCTGGTGGAGG 
CAGAGAAGGGGTGTAATCTGGAGCTCTCAGAGAGGTGGAAGGATGCTACCAAGAACAGGGAAGATGCAC 
CGGGAGACCAGGTGAAGCTTGACCAATATTCTGCGGCACTGGCTCAGAGGGACAGGAGAATTGAAGAGC 
TGAGGCAGAGCTTGGCTGCCCAGGAGGGGCTTGTGGAACAGCTGTCTCGAGAGAAACAACAACTGTTAC 
ATCTGCTGGAGGAGCCTGGGGGCATGGAAGTGCAGCCCATGCCTAAAGGGTTACCCACGCAACAAAAGC 
CAGACCTAAATGAGACCCCTACAACCCAGCCATCTGTGTCTGATTCCCACCTGGCAGAACTCCAGGACA 
AAATCCAGCAAACAGAGGTCACCAACAAGATTCTTCAAGAGAAACTGAATGACATGAGCTGTGAGCTCA 
GATCTGCACAGGAGTCGTCTCAGAAGCAAGATACGACAATCCAAAGCCTCAAGGAAATGCTAAAGAGCA 
GGGAAAGTGAGACTGAAGAGCTGTACCAGGTGATTGAAGGTCAAAATGACACAATGGCAAAGCTTCCGG 
AAATGCTACACCAGAGCCAGCTCGGACAGCTCCAGAGCTCAGAGGGCATTGCCCCTGCTCAGCAGCAAG 
TGGCCCTGCTTGACCTTCAGAGTGCTCTGTTCTGCAGCCAGCTTGAAATCCAGAAGCTCCAGAGGCTGT 
TACGCCAGAAAGAGCGTCAGCTGGCTGACGGCAAGCGGTGCATGCAATTTGTGGAGGCTGCAGCACAGG 
AGAGAGAGCAGCAGAAGGAAGCTGCTTGGAAACATAACCAGGAATTACGAAAAGCTTTGCAACACCTCC 
AAGGAGAACTGCACAGTAAGAGCCAACAGCTCCACGTTCTGGAGGCAGAAAAATATAATGAAATTCGAA 
CCCAGGGACAAAACATTCAACACCTAAGTCACAGTCTGAGTCACAAAGAGCAGCTAATTCAGGAAC^ 
AGGAGCTCCTACAGTATCGGGATACCACAGACAAAACTCTAGACACAAATGAGGTGTTTCTTGAGAAAC 
TACGGCAACGAATACAAGACCGGGCAGTTGCTCTAGAGCGGGTTATAGATGAAAAGTTCTCTGCTCTAG 
AAGAAAAGGACAAGGAACTGCGGCAGCTCCGGCTTGCTGTGAGGGACCGAGACCATGACTTAGAGAGAC 
TGCGTTGTGTCCTGTCTGCCAATGAAGCTACCATGCAAAGTATGGAGAGTCTCCTGAGGGCCAGAGGCC 
TGGAAGTGGAGCAGTTAATTGCCACCTGCCAAAACCTCCAGTGGTTGAAGGAAGAATTGGAAACCAAGT 
TTGGCCACTGGCAGAAGGAACAGGAGAGCATCATTCAGCAGTTACAGACATCTCTGCATGACAGGAACA 
AAGAAGTAGAGGATCTCAGTGCAACTTTGCTCCACAAACTTGGACCCGGCCAGAGTGAAGTAGCTGAGG 
AGCTGTGCCAGCGCCTGCAGCGGAAGGAAAGGGTGCTGCAGGACCTTCTGAGTGATCGGAACAAACAAG 
CCATGGAGCACGAGATGGAGGTCCAGGGACTGCTCCAGTCGATGGGCACCCGGGAACAGGAAAGACAGG 
CTGTTGCAGAAAAAATGGTACAAGCCTTCATGGAAAGAAACTCGGAATTACAGGCCCTGCGGCAGTATC 
TAGGGGGGAAGGAATTAATGGCAGCATCTCAGGCATTCATCTCTAACCAACCAGCTGGAGCGACTTCTG 
TAGGCCCCCACCATGGAGAGCAAACTGACCAAGGTTCTACGCAGATGCCCTCTCGAGACGACAGCACCT 
CGCTGACTGCCAGAGAGGAGGCCAGCATACCCCGGTCTACATTAGGAGACTCAGACACAGTTGCAGGGC 
TGGAGAAAGAACTGAGCAATGCCAAGGAGGAGCTTGAGCTCATGGCCAAAAAAGAAAGAGAAAGCCAGA 
TAGAATTGTCTGCCCTGCAGTCCATGATGGCTGTGCAAGAGGAAGAGCTGCAGGTGCAGGCTGCTGACT 
TGGAGTCCCTGACCAGGAACATACAGATAAAAGAAGACCTCATA^CGACCTGCAAATGCAACTGCTTG 



wo 00/2786] 



PCT/US99/26860 



3/12 
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ACCCTGAAGATATGCCAGCCATGGAGCGCCTGACCCAAGAGGTCTTACTTCTTCGGGAAAAAGTTGCTT 

CAGTGGAACCCCAGGGTCAGGAAGGGTCAGAGAACAGGAGAC.iVACAGTTGCTGCTGATGTTAGAAGGAC 

TAGTGGATGAACGGAGTCGGCTCAACGAGGCCCTGCAAGCTGAGCGGCAGCTCTACAGCAGCCTGGTCA 

AGTTCCATGCCCAACCAGAGATCTCTGAGAGAGACCGAACTCTGCAGGTGGAACTGGAAGGGGCCCAGG 

TGTTACGCAGTCGACTAGAAGAAGTTCTTGGAAGAAGCCTGGAGCGCTTAAGCAGGCTGGAGACCCTGG 

CCGCCATTGGAGGTGCTACTGCAGGCGATGAGACTGAAGATACAAGCACACAGTTCACAGACAGCATTG 

AGGAGGAGGCTGCACACAACAGCCACCAGCAACTCATCAAGGTGTCTTTGGAGAAAAGCCTGACCACCA 

TGGAGACCCAGAACACATGTCTTCAGCCCCCTTCCCCAGTAGGAGAGGATGGTAACAGGCATCTTCAGG 

AAGAAATGCTCCACCTGAGGGCTGAAATCCACCAGCCCTTAGAAGAGAAGAGAAAAGCTGAGGCAGAAC 

TCAAGGAGCTAAAGGCTCAAATTGAGGAAGCAGGATTCTCCTCTGTGTCCCACATCAGGAACACCATGC 

TGAGCCTTTGCCTTTGCCTTGAGAATGCAGAGCTGAAAGAGCAGATGGGAGAAGCAATGTCTGATGGAT 

GGGAGGTGGAGGAAGACAAGGAGAAGGGCGAGGTGATGGTGGAGACCGTGGTGGCCAAAGGGGGTCTGA 

GTGAGGACAGCCTTCAGGCTGAGTTCAGO^AAGTCCAGGGGAGACTCAAGAGTGCCTACAACATCATCA 

ACCTCCTCAAAGAGCAGCTGGTCCTGAGAAGCTCGGAAGGGAACACTAAGGAGATGCCAGAGTTCCTCG 

TGCGCCTGGCCAGGGAGGTGGACAGAATGAACATGGGCTTGCCTTCCTCGGAGAAGCATCAACACCAAG 

AACAGGAGAATATGACCGCAAGGCCTGGCCCCAGGCCCCAGAGTCTCAAGCTTGGGACAGCTCTCTCAG 

TAGACGGCTACCAACTGGAGAACAAGTCCCAGGCCCAAGACTCTGGACATCAGCCAGAATTTAGCCTAC 

CAGGGTCCACCAAACACCTGCGCTCCCAGCTGGCTCAGTGTAGACAACGGTACCAAGATCTCCAGGAGA 

AGCTGCTCATCTCAGAAGCCACTGTGTTTGCCCAGGCAAACCAGCTAGAGAAGTACAGAGCCATATTAA 

GTGAATCCCTGGTGAAGCAGGACAGCAAGCAGATCCAGGTGGACCTTCAGGACCTGGGCTATGAGACTT 

GTGGCCGAAGTGAGAATGAAGCTGAACGTGAGGAGACCACCAGCCCTGAGTGTGAGGAGCACGGTAACC 

TGAAGCCTGTGGTGCTGGTGGAAGGCTTGTGCTCTGAGCAAGGGTACCTGGACCCTGTCTTGGTCAGCT 

CACCTGTGAAGAACCCTTGGAGAACAAGCCAGGAAGCCAGAAGAATCCAGGCACAAGGAACTTCAGACA 

ACAGCTCTCTCCTGAGGAAGGACATCCGAAATCTGAAAGCCCAGCTACCGAATGCCTACAAGGTCCTTC 

AGAACCTGAGGAGCCGGGTCCGGTCCCTGTCTGCCACAAGCGATTACTCATCGAGTCTGGAGAGACCCC 

GCAAGCTGATAGCCGTGGCAACCCTTGAGGGGGCCTCACCCCACAGTGTCACTGATGAAGACGAAGGCT 

TGTTGTCAGATGGCACCGGGGCTTTTTACCCTCCAGGGCTCCAGGCCAAAAAGAATCTAGAGAATCTCA 

TCCAGAGAGTATCCCAGCTGGAGGCCCAGCTCCCCAAAACTGGACTAGAAGGGAAGCTGGCTGAAGAAC 

TGAAGTCCGCCTCGTGGCCTGGAAAATACGATTCTTTGATTCAGGATCAGGCCCGAAAAACTGTCATAT 

CTGCGTCCGAAAATACXAAAAGGGAGAAGGATTTGTTTTCTTCTCACCCAACATTCGAAAGATACGTCA 

AATCTTTTGAAGACCTCCTGAGGAACAACGACTTGACTACTTACCTGGGCCAGAGCTTCCGGGAACAAC 

TTAGTTCAAGGCGTTCAGTGACAGACAGGCTGACCAGCAAATTCAGCACAAAGGATCATAAGAGTGAAA 

AAGAAGAAGTTGGGCTTGAGCCACTGGCCTTCAGGTTCAGCAGGGAATTACAGGAGAAAGAGAAAGTGA 

TTGAAGTCCTGCAGGCCAAGGTGGATACCCGGTTTTTCTCACCCCCCAGCAGCCATGCTGCGTCTGAGT 

CCCACCGTTGTGCCAGCAGCACATCTTTCCTGTCGGATGACATAGAAGCCTGCTCTGACATGGACGTAG 

CCAGCGAGTACACACACTATGAAGAGAAGAAGCCCTCACCCAGTAACTCAGCAGCCAGTGCATCTCAGG 

GGCTTAAGGGCGAGCCCAGAAGCAGCTCCATCAGCTTGCCAACTCCCCAGAACCCCCCTAAGGAGGCCA 

GCCAGGCCCAGCCAGGCTTTCACTTTAACTCCATACCCAAGCCGGCTAGCCTTTCCCAGGCACCAATGC 

ACTTCACTGTACCCAGCTTCATGCCTTTCGGCCCCTCTGGGCCTCCCCTTCTTGGTTGCTGTGAGACAC 

CAGTGGTGTCCTTGGCTGAGGCTCAACAAGAGCTGCAGATGCTGCAGAAGCAGCTGGGACGAAGTGTTA 

GCATTGCCCCTCCCACCTCCA»TCCACGTTGCTTAGCAACCACACAGAAGCTAGCTCTCCCCGCTACA 

GCAACCCTGCTCAGCCCCACTCCCCAGCAAGGGGCACCATAGAGCTGGGCAGAATCCTGGAGCCTGGAT 

ACCTGGGCAGCGGCCAGTGGGAO^TGATGAGGCCTCAGAAAGGGAGCATCTCTGGGGAGCTGTCCTCAG 

GCTCCTCGATGTACCAGCTTAACTCCAAACCCACAGGGGCCGACCTGTTGGAAGAGCATTTAGGTGAGA 

TCCGGAACCTGCGCCAGCGCCTGGAGGAGTCCATATGTGTCAATGACAGGCTACGGGAGCAGCTGCAGC 

ATAGGCTCAGCTCCACGGCCCGAGAAAATGGTTCCACCTCTCACTTCTACAGTCAGGGCCTGGAGTCCA 

TGCCTCAGCTCTACAATGAGAACAGAGCCCTCAGGGAAGAAAACCAAAGCCTGCAGACACGGCTCAGTC 

ATGCTTCCAGGGGACACTCCCAGGAAGTGGACCACCTGAGGGAGGCTCTGCTTTCCTCAAGTTCCCAGC 

TCCAGGAGCTGGAGAAGGAGCTGGAGCAGCAGAAGGCTGAGCGGCGGCAGCTTCTGGAAGACTTGCAGG 

AGAAGCAGGATGAGATCGTGCATTTCCGAGAGGAGAGGCTGTCCCTCCAGGAAAACAACTCCAGGCTGC 

AGCACAAGCTGGCCCTCCTGCAACAACAGTGTGAGGAGAAACAGCAGCTCTCCCTGTCCCTGCAGTCAG 

AGCTCCAGATCTACGAGTCCCTC7ACGAAAATCCTAAGAAGGGCTTGAAAGCCTTCAGCCTAGATTCCT 

GTTACCAAGTCCCGGGTGAGTTGAGCTGCCTCJGTGGCAGAGATTCGAGCTCTGAGAGTGCAGTTGGAGC 

agagcattcaagtgaacaaccgtctgcggctgcagctggaacagcagatggatcacggtgctggcaaag 
ccagtctcagttcctgccctgttaaccagagcttctcagccaaggcggagctggcaaaccagcagccac 
ccttccaaggttcagctgcttcccctccagtccgggacgttggcttgaattctccacccgtggtcctcc 
ccagcaattcgtgctctgttcctggctcagactctgccatcatcagtaggacaaacaatggttcggatg 
agtctgcagcaacgaagacccctcccaagatggaggtcgatgctgctgatggcccatttgccagtggac 
acggcagacacgtcatcggccatgtggatgactacgacgccctacagcagcagattggggaagggaagc 
tgctgatcca;ij^agatactgtctctcacgaggccagcacgcagcgtccctgcactcgacgcgcagggca 
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FIG J (CONT) 

CAGAGGCACCAGGTACCAAAAGTGTCCATGAGCTTCGGAGCAGCGCCAGGGCTCTGAACCACAGCCTAG 
AAGAGTCAGCTTCCCTCCTCACCATGTTCTGGAGAGCAGCTTTGCCAAACTCTCATGGTTCTGTACTGG 
TAGGCGAAGAGGGAAACCTGATGGAGAAAGAACTCCTAGACCTGCGAGCCCAAGTGTCCCAACAGCAAC 
AGCTCCTTCAGAGCACTGCTGTGCGTCTGAAGACGGCCAACCAGAGGAAGAAAAGCATGGAGCAGTTCA 
TCGTGAGCCATCTGACCAGGACCCATGATGTCTTGAAGAAAGCACGGACTAATTTAGAGATGAAATCCT 
TCAGGGCCCTGATGTGCACTCCAGCCTTGTGACCCTTGCCTTCCAGGAGCCACATAAAAGGCGAAGCCA 
GGAGTCCTTAAAACAGCAGGAAGGGTGGGCCTGCCCGCCCCTAGTACAGCTGCCTGTCTGCTGAGGAAT 
ACCTGGTCCGACTCCTCCCCTGCTGGAGCTCCAGGGAAGGGCTCATATATGTGTCCACATGGGACAGGC 
AGGAAGGAAAGTGGCATCCTGACAATGAATATGATTAGCCAAGGCCCACTGGGCCCATCACTAAGCAAA 
ACTCATGTAGACTGTGTAGAAGGCCCCCCGGCACTGCTTCTAGACAGCCTCAGCAGCACGGTGCCCACC 
TCGTTACAGTTCTCACCTCAAGATAGCCAACTCAGGGGAACTAGGACCTTACCACCCACAAACAGGATG 
TGTGGTCCCAATGCCAACGCTCCTCAGACAGTTGTAAAAGCACACATCATTGAGTGGCAGCGTCCAGCC 
GGACACTGTTGGAGACTACCAAACCCCTCACTGACCCAGTCTTGGGCCAGGCCAGCTCTGTGGGCCAAG 
TCTGGTAGTACTTTGGTCTCTACCACCACACCAGAGAGAGTCTATATAGCAAATGTGGTAACTTGTAGG 
TGCCCTGCACTTAGCCTAGCACCTTCTGTTTCTTACGTGATCTCAAGTTGAACCAACTTCCTTAACTCT 
GCTGTCCCCTGAATCCTAACTTCCCTCAGGGGAATTGGAGATTGGTGGCCACATCATGCCTATTGAATG 
TTTAGTGAACAGCATATCGGTGCCTCTTAATGGCATGGGCAAGGCCTGCTCTGTACTGAAGACTGTGTC 
TTCACAGTGCTCATAGGACGTGGGTGTGTGTATAAATGTATAATATAGATTTATATATGTCGCTATGGC 
TATGTGTTGAAGGCCAGCATAAGTGCAGAGCGATGGGTGAGAAGACGCTAAGCAGTCTTTCTTATGGCT 
ATTAAAGCTAACTGTGTAC (SEQ ID NO: 02) 
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MYOMEGALIN firstMET until stop 

ATGTCTWGGATATCGCACTCTGTCCCAGCACCTCAATGACCTGAAGAAGGAGAACTTCAGCCTCAAG 

CTGCGCATCTACTTCCTGGAGGAGCGCATGCAACAGAAGTATGAAGTCAGCCGGGAGGACGTCTACAAG 

CGGAACATTGAGCTGAAGGTTGAAGTGGAGAGCCTGAAACGAGAGCTCCAGGACAGGAAACAGCATCTA 

CATAAAACATGGGCCGATGAGGAGGATCTCAACAGCCAGAATGAAGCAGAGCTCCGGCGCCAGGTTGAA 

GAACCGCAGCAGGAGACAGAACACGTTTATGAGCTCCTAGACAACAACATTCAGCTGCTGCAGGAGGAA 

TCCAGGTTTGCAAAGGATGAAGCCACACAGATGGAGACTCTGGTGGAGGCAGAGAAGGGGTGTAATCTG 

GAGCTCTCAGAGAGGTGGAAGGATGCTACCAAGAACAGGGAAGATGCACCGGGAGACCAGGTGAAGCTT 

GACCAATATTCTGCGGCACTGGCTCAGAGGGACAGGAGAATTGAAGAGCTGAGGCAGAGCTTGGCTGCC 

CAGGAGGGGCTTGTGGAACAGCTGTCTCGAGAGAAACAACAACTGTTACATCTGCTGGAGGAGCCTGGG 

GGCATGGAAGTGCAGCCCATGCCTAAAGGGTTACCCACGCAACAAAAGCCAGACCTAAATGAGACCCCT 

ACAACCCAGCCATCTGTGTCTGATTCCCACCTGGCAGAACTCCAGGACAAAATCCAGCAAACAGAGGTC 

ACCAACAAGAT7C7TCAAGAGAAACTGAATGACATGAGCTGTGAGCTCAGATCTGCACAGGAGTCGTCT 

CAGAAGCAAGATACGACAATCCAAAGCCTCAAGGAAATGCTAAAGAGCAGGGAAAGTGAGACTGAAGAG 

CTGTACCAGGTGATTGAAGGTCAAAATGACACAATGGCAAAGCTTCCGGAAATGCTACACCAGAGCCAG 

CTCGGACAGCTCCAGAGCTCAGAGGGCATTGCCCCTGCTCAGCAGCAAGTGGCCCTGCTTGACCTTCAG 

AGTGCTCTGTTCTGCAGCCAGCTTGAAATCCAGAAGCTCCAGAGGCTGTTACGCCAGAAAGAGCGTCAG 

CTGGCTGACGGCAAGCGGTGCATGCAATTTGTGGAGGCTGCAGCACAGGAGAGAGAGCAGCAGAAGGAA 

GCTGCTTGGAAACATAACCAGGAATTACGAAAAGCTTTGCAACACCTCCAAGGAGAACTGCACAGTAAG 

AGCCAACAGCTCCACGTTCTGGAGGCAGAAAAATATAATGAAATTCGAACCCAGGGACAAAACATTCAA 

CACCTAAGTCACAGTCTGAGTCACAAAGAGCAGCTAATTCAGGAACTTCAGGAGCTCCTACAGTATCGG 

GATACCACAGACAAAACTCTAGACACAAATGAGGTGTTTCTTGAGAAACTACGGCAACGAATACAAGAC 

CGGGCAGTTGCTCTAGAGCGGGTTATAGATGAAAAGTTCTCTGCTCTAGAAGAAAAGGACAAGGAACTG 

CGGCAGCTCCGGCTTGCTGTGAGGGACCGAGACCATGACTTAGAGAGACTGCGTTGTGTCCTGTCTGCC 

AATGAAGCTACCATGCAAAGTATGGAGAGTCTCCTGAGGGCCAGAGGCCTGGAAGTGGAGCAGTTAATT 

GCCACCTGCCAAAACCTCCAGTGGTTGAAGGAAGAATTGGAAACCAAGTTTGGCCACTGGCAGAAGGAA 

CAGGAGAGCATCATTCAGCAGTTACAGACATCTCTGCATGACAGGAACAAAGAAGTAGAGGATCTCAGT 

GCAACTTTGCTCCACAAACTTGGACCCGGCCAGAGTGAAGTAGCTGAGGAGCTGTGCCAGCGCCTGCAG 

CGGAAGGAAAGGGTGCTGCAGGACCTTCTGAGTGATCGGAACAAACAAGCCATGGAGCACGAGATGGAG 

GTCCAGGGACTGCTCCAGTCGATGGGCACCCGGGAACAGGAAAGACAGGCTGTTGCAGAAAAAATGGTA 

CAAGCCTTCATGGAAAGAAACTCGGAATTACAGGCCCTGCGGCAGTATCTAGGGGGGAAGGAATTAATG 

GCAGCATCTCAGGCATTCATCTCTAACCAACCAGCTGGAGCGACTTCTGTAGGCCCCCACCATGGAGAG 

CAAACTGACCAAGGTTCTACGCAGATGCCCTCTCGAGACGACAGCACCTCGCTGACTGCCAGAGAGGAG 

GCCAGCATACCCCGGTCTACATTAGGAGACTCAGACACAGTTGCAGGGCTGGAGAAAGAACTGAGCAAT 

GCCAAGGAGGAGCTTGAGCTCATGGCCAAAAAAGAAAGAGAAAGCCAGATAGAATTGTCTGCCCTGCAG 

TCCATGATGGCTGTGCAAGAGGAAGAGCTGCAGGTGCAGGCTGCTGACTTGGAGTCCCTGACCAGGAAC 

ATACAGATAAAAGAAGACCTCATAAAGGACCTGCAAATGCAACTGGTTGACCCTGAAGATATGCCAGCC 

ATGGAGCGCCTGACCCAAGAGGTCTTACTTCTTCGGGAAAAAGTTGCTTCAGTGGAACCCCAGGGTCAG 

GAAGGGTCAGAGAACAGGAGACAACAGTTGCTGCTGATGTTAGAAGGACTAGTGGATGAACGGAGTC6G 

CTCAACGAGGCCCTGCAAGCTGAGCGGCAGCTCTACAGCAGCCTGGTCAAGTTCCATGCCCAACCAGAG 

ATCTCTGAGAGAGACCGAACTCTGCAGGTGGAACTGGAAGGGGCCCAGGTGTTACGCAGTCGACTAGAA 

GAAGTTCTTGGAAGAAGCCTGGAGCGCTTAAGCAGGCTGGAGACCCTGGCCGCCATTGGAGGTGCTACT 

GCAGGCGATGAGACTGAAGATACAAGCACACAGTTCACAGACAGCATTGAGGAGGAGGCTGCACyVC^^ 

AGCCACCAGCAACTCATCAAGGTGTCTTTGGAGAAAAGCCTGACCACCATGGAGACCCAGAACACATGT 

CTTCAGCCCCCTTCCCCAGTAGGAGAGGATGGTAACAGGCATCrrCAGGAAGAAATGCTCCACCTGAGG 

GCTGAAATCCACCAGCCCTTAGAAGAGAAGAGAAAAGCTGAGGCAGAACTCAAGGAGCTAAAGGCTCAA 

ATTGAGGAAGCAGGATTCTCCTCTGTGTCCCACATCAGGAACACCATGCTGAGCCTTTGCCTTTGCCTT 

GAGAATGCAGA6CTGAAAGAGCAGATGGGAGAAGCAATGTCTGATGGATGGGAGGTGGAGGAAGACAAG 

GAGAAGGGCGAGGTGATGGTGGAGACCGTGGTGGCCAAAGGGGGTCTGAGTGAGGACAGCCTTCAGGCT 

GAGTTCAGGAAAGTCCAGGGGAGACTCAAGAGTGCCTACAACATCATCAACCTCCTCAAAGAGCAGCTG 

GTCCTGAGAAGCTCGGAAGGGAACACTAAGGAGATGCCAGAGTTCC7CGTGCGCCTGGCCAGGGAGGTG 

GACAGAATGAACATGGGCTTGCCTTCCTCGGAGAAGCATCAACACCAAGAACAGGAGAATATGACCGCA 

AGGCCTGGCCCCAGGCCCCAGAGTCTCAAGCTTGGGACAGCTCTCTCAGTAGACGGCTACCAACTGGAG 

AACAAGTCCCAGGCCCAAGACTCTGGACATCAGCCAGAATTTAGCCTACCAGGGTCCACCAAACACCTG 

CGCTCCCAGCTGGCTCAGTGTAGACAACGGTACCAAGATCTCCAGGAGAAGCTGCTCATCTCAGAAGCC 

ACTGTGTTTGCCCAGGCAAACCAGCTAGAGAAGTACAGAGCCATATTAAGTGAATCCCTGGTGAAGCAG 

GACAGCAAGCAGATCCAGGTGGACCTTCAGGACCTGGGCTATGAGACTTGTGGCCGAAGTGAGAATGAA 

GCTGAACGTGAGGAGACCACCAGCCCTGAGTGTGAGGAGCACGGTAACCTGAAGCCTGTGGTGCTGGTG 

GAAGGCTTGTGCTCTGAGCAAGGGTACCTGGACCCTGTCTTGGTCAGCTCACCTGTGAJi.GAACCCTTGG 

AGAACAAGCCAGGAAGCCAGAAGAATCCAGGCACAAGGAACTTCAGACAA.CACCTCTCTCCTGAGGAAG 

GACATCCGAAA.TCTGAAAGCCCAGCTACCGAATGCCTACAAGGTCCTTCAGAACCTGAOGAGCCGGGTC 
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FIG. 3 (conO 

CGGTCCCTGTCTGCCACAAGCGATTACTCATCGAGTCTGGAGAGACCCCGCAAGCTGATAGCCGTGGCA 

ACCCTTGAGGGGGCCTCACCCCACAGTGTCACTGATGAAGACGAAGGCTTGTTGTCAGATGGCACCGGG 

GCTTTTTACCCTCCAGGGCTCCAGGCCAAAAAGAATCTAGAGAATCTCATCCAGAGAGTATCCCAGCTG 

GAGGCCCAGCTCCCCAAAACTGGACTAGAAGGGAAGCTGGCTGAAGAACTGAAGTCCGCCTCGTGGCCT 

GGAAAATACGATTC7TTGATTCAGGATCAGGCCCGAAAAACTGTCATATCTGCGTCCGAAAATACXAAA 

AGGGAGAAGGATTTGTTTTCTTCTCACCCAACATTCGAAAGATACGTCAAATCTTTTGAAGACCTCCTG 

AGGAACAACGACTTGACTACTTACCTGGGCCAGAGCTTCCGGGAACAACTTAGTTCAAGGCGTTCAGTG 

ACAGACAGGCTGACCAGCAAATTCAGCACAAAGGATCATAAGAGTGAAAAAGAAGAAGTTGGGCTTGAG 

CCACTGGCCTTCAGGTTCAGCAGGGAATTACAGGAGAAAGAGAAAGTGATTGAAGTCCTGCAGGCCAAG 

GTGGATACCCGGTTTTTCTCACCCCCCAGCAGCCATGCTGCGTCTGAGTCCCACCGTTGTGCCAGCAGC 

ACATCTTTCCTGTCGGATGACATAGAAGCCTGCTCTGACATGGACGTAGCCAGCGAGTACACACACTAT 

GAAGAGAAGAAGCCCTCACCCAGTAACTCAGCAGCCAGTGCATCTCAGGGGCTTAAGGGCGAGCCCAGA 

AGCAGCTCCATCAGCTTGCCAACTCCCCAGAACCCCCCTAAGGAGGCCAGCCAGGCCCAGCCAGGCTTT 

CACTTTAACTCCATACCCAAGCCGGCTAGCCTTTCCCAGGCACCAATGCACTTCACTGTACCCAGCTTC 

ATGCCTTTCGGCCCCTCTGGGCCTCCCCTTCTTGGTTGCTGTGAGACACCAGTGGTGTCCTTGGCTGAG 

GCTCAACAAGAGCTGCAGATGCTGCAGAAGCAGCTGGGACGAAGTGTTAGCATTGCCCCTCCCACCTCC 

ACATCCACGTTGCTTAGCAACCACACAGAAGCTAGCTCTCCCCGCTACAGCAACCCTGCTCAGCCCCAC 

TCCCCAGCAAGGGGCACCATAGAGCTGGGCAGAATCCTGGAGCCTGGATACCTGGGCAGCGGCCAGTGG 

GACATGATGAGGCCTCAGAAAGGGAGCATCTCTGGGGAGCTGTCCTCAGGCTCCTCGATGTACCAGCTT 

AACTCCAAACCCACAGGGGCCGACCTGTTGGAAGAGCATTTAGGTGAGATCCGGAACCTGCGCCAGCGC 

CTGGAGGAGTCCATATGTGTCAATGACAGGCTACGGGAGCAGCTGCAGCATAGGCTCAGCTCCACGGCC 

CGAGAAAATGGTTCCACCTCTCACTTCTACAGTCAGGGCCTGGAGTCCATGCCTCAGCTCTACAATGAG 

AACAGAGCCCTCAGGGAAGAAAACCAAAGCCTGCAGACACGGCTCAGTCATGCTTCCAGGGGACACTCC 

CAGGAAGTGGACCACCTGAGGGAGGCTCTGCTTTCCTCAAGTTCCCAGCTCCAGGAGCTGGAGAAGGAG 

CTGGAGCAGCAGAAGGCTGAGCGGCGGCAGCTTCTGGAAGACTTGCAGGAGAAGCAGGATGAGATCGTG 

CATTTCCGAGAGGAGAGGCTGTCCCTCCAGGAAAACAACTCCAGGCTGCAGCACAAGCTGGCCCTCCTG 

CAACAACAGTGTGAGGAGAAACAGCAGCTCTCCCTGTCCCTGCAGTCAGAGCTCCAGATCTACGAGTCC 

CTCTACGAAAATCCTAAGAAGGGCTTGAAAGCCTTCAGCCTAGATTCCTGTTACCAAGTCCCGGGTGAG 

TTGAGCTGCCTGGTGGCAGAGATTCGAGCTCTGAGAGTGCAGTTGGAGCAGAGCATTCAAGTGAACAAC 

CGTCTGCGGCTGCAGCTGGAACAGCAGATGGATCACGGTGCTGGCAAAGCCAGTCTCAGTTCCTGCCCT 

GTTAACCAGAGCTTCTCAGCCAAGGCGGAGCTGGCAAACCAGCAGCCACCCTTCCAAGGTTCAGCTGCT 

TCCCCTCCAGTCCGGGACGTTGGCTTGAATTCTCCACCCGTGGTCCTCCCCAGCAATTCGTGCTCTGTT 

CCTGGCTCAGACTCTGCCATCATCAGTAGGACAAACAATGGTTCGGATGAGTCTGCAGCAACGAAGACC 

CCTCCCAAGATGGAGGTCGATGCTGCTGATGGCCCATTTGCCAGTGGACACGGCAGACACGTCATCGGC 

CATGTGGATGACTACGACGCCCTACAGCAGCAGATTGGGGAAGGGAAGCTGCTGATCCAAAAGATACTG 

TCTCTCACGAGGCCAGCACGCAGCGTCCCTGCACTGGACGCGCAGGGCACAGAGGCACCAGGTACCAAA 

AGTGTCCATGAGCTTCGGAGCAGCGCCAGGGCTCTGAACCACAGCCTAGAAGAGTCAGCTTCCCTCCTC 

ACCATGTTCTGGAGAGCAGCTTTGCCAAACTCTCATGGTTCTGTACTGGTAGGCGAAGAGGGAAACCTG 

ATGGAGAAAGAACTCCTAGACCTGCGAGCCCAAGTGTCCCAACAGCAACAGCTCCTTCAGAGCACTGCT 

GTGCGTCTGAAGACGGCCAACCAGAGGAAGAAAAGCATGGAGCAGTTCATCGTGAGCCATCTGACCAGG 

ACCCATGATGTCTTGAAGAAAGCACGGACTAATTTAGAGATGAAATCCTTCAGGGCCCTGATGTGCACT 

CCAGCCTTGTGA (SEQ ID NO: 03) 
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FIG. 4 
Human oyomagalln cDNK 

1 GGATCCTTGA GGGCACTGGT GCGACTTTCA GGTGAGGTCT TAGCAGATGA 

51 AAGCGGCTGG CTGTGGCCCG CGCCAGTAGT GCTTTCTGCT CCGCAC7CGC 

101 CGTGAGCCAG GTGTGCAACC GGATTTGGGG CGAGGGTCGC GCTGGCTACC 

151 TCGCATGCGC AGAGCCGGAA GCCCGCTGAC CGGACTACAG CTCCCAGAAG 

201 AGCCTTGTGG AGGCCGCAGA CGCGAAGCCG CTGGCGCCAT CTTGAAATCT 

251 GATCCTCCAT CCCCGAGGCT TTGCGTCTGC GCGGCCGGCC GCTGCTGCTC 

301 CGGGAGCCCA GTCTGCTAAA AGGGGAGGAC GTTGAGGACG CGGCGGCTGG 

351 CGGGAGAGAC AGCTGGGGAG AGACATGGCA GGGTCGGAGC GCGGCCTGCG 

401 CCTCTGTCAC TCAGCATCCT CTTAGGCGTT TCCACGCCCG CCCCCTGCCC 

451 GAGGGGCGGG GCTGACGGCT CTGGTACCCG GAGTCGGCGC GCGGGGCAGG 

501 GGCGCGCCCC TGCAGAGTGG GGACCCCACT GGGCTGTGCC ATGCTGACCG 

551 GAGACCACCG AGGCGGGAGA CAGAGCGCGG CGAAGAGCCA TTGAGTGGTC 

601 ACCCAGTAGC CGCCGCCGCC GCCGCCTCGG GAAGCTTGCC ACCCGCTAGG 

651 AGGGAAGATG AAGGAGATTT GCAGGATCTG TGCCCGAGAG CTGTGTGGAA 

701 ACCAGCGGCG CTGGATCTTC CACACGGCGT CCAAGCTCAA TCTCCAGGTT 

751 CTGCTTTCGC ACGTCTTGGG CAAGGATGTC CCCCGCGATG GCAAAGCCGA 

801 GTTCGCTTGC AGCAAGTGTG CTTTCATGCT TGATCGAATC TATCGATTCG 

851 ACACAGTTAT TGCCCGGATT GAAGCGCTTT CTATTGAGCG CTTGCAAAAG 

901 CTGCTACTGG AGAAGGATCG CCTCAAGTTC TGCATTGCCA GTATGTATCG 

951 GAAGAATAAC GATGACTCTG GCGCGGAGAT CAAGGCGGGG AATGGGACGG 

1001 TTGACATGTC CGTCTTACCC GATGCGAGAT ACTCTGCACT GCTCCAGGAG 

1051 GACTTCGCCT ATTCAGGGTT TGAGTGCTGG GTGGAGAATG AGGATCAGAT 

1101 CCAGGAGCCA CACAGCTGCC ATGGTTCAGA AGGCCCTGGA AACCGACCCA 

1151 GGAGATGCCG TGGTTGTGCC GCTTTGCGGG TTGCTGATTC TGACTATGAA 

1201 GCCATTTGTA AGGTACCTCG AAAGGTGGCC AGAAGTATCT CCTGCGGCCC 

1251 TTCTAGCAGG TGGTCGACCA GCATTTGCAC TGAAGAACCA GCGTTGTCTG 

1301 AGGTTGGGCC ACCCGACTTA GCAAGCACAA AGGTACCCCC AGATGGAGAA 

1351 AGCATGGAGG AAGAGACGCC TGGTTCCTCT GTGGAATCTT TGGATGCAAG 

1401 CGTCCAGGCT AGCCCTCCAC AACAGAAAGA TGAGGAGACT GAGAGAAGTG 

1451 CAAAGGAACT TGGAAAGTGT GACTGTTGTT CAGATGATCA GGCTCCGCAG 

1501 CATGGGTGTA ATCACAAGCT GGAATTAGCT CTTAGCATGA TTAAAGGTCT 

1551 TGATTATAAG CCCATCCAGA GCCCCCGAGG GAGCAGGCTT CCGATTCCAG 

1601 TGAAATCCAG CCTACCTGGA GCCAAGCCTG GCCCTAGCAT GACAGATGGA 

1651 GTTAGTTCCG GTTTCCTTAA CAGGTCTTTG AAACCCCTTT ACAAGACACC 

1701 TGTGAGTTAT CCCTTGGAGC TTTCAGACCT GCAGGAGCTG TGGGATGATC 

1751 TCTGTGAAGA TTATTTGCCG CTCCGGGTCC AGCCCATGAC TGAAGAGTTG 

1801 CTGAAACAAC AAAAGCTGAA TTCACATGAG ACCACTATAA CTCAGCAGTC 

1851 TGTATCTGAT TCCCACTTGG CAGAACTCCA GGAAAAAATC CAGCAAACAG 

1901 AGGCCACCAA CAAGATTCTT CAAGAGAAAC TTAATGAAAT GAGCTATGAA 

1951 CTAAAGTGTG CTCAGGAGTC GTCTCAAAAG CAAGATGGTA CAATTCAGAA 

2001 CCTCAAGGAA ACTCTGAAAA GCAGGGAACG TGAGACTGAG GAGTTGTACC 

2051 AGGTAATTGA AGGTCAAAAT GACACAATGG CAAAGCTTCG AGAAATGCTG 

2101 CACCAAAGCC AGCTTGGACA ACTTCACAGC TCAGAGGGTA CTTCTCCAGC 

2151 TCAGCAACAG GTAGCTCTGC TTGATCTTCA GAGTGCTTTA TTCTGCAGCC 

2201 AACTTGAAAT ACAGAAGCTC CAGAGGGTGG TACGACAGAA AGAGCGCCAA 

2251 CTGGCTGATG CCAAACAATG TGTGCAATTT GTAGAGGCTG CAGCACACGA 

2301 GAGTGAACAG CAGAAAGAGG CTTCTTGGAA ACATAACCAG GAATTGCGAA 

2351 AAGCCTTGCA GCAGCTACAA GAAGAATTGC AGAATAAGAG CCAACAGCTT 

2401 CGTGCCTGGG AGGCTGAAAA ATACAATGAG ATTCGAACCC AGGAACAAAA 

2451 CATCCAGCAC CTAAACCATA GTCTGAGTCA CAAGGAGCAG TTGCTTCAGG 

2501 AATTTCGGGA GCTCCTACAG TATCGAGATA ACTCAGACAA AACCCTTGAA 

2551 GCAAA7GAAA TGTTGCTTGA GAAACTTCGC CAGCGAATAC ATGATAAAGC 

2601 TGTTGCTCTG GAGCGGGCTA TAGATGAAAA ATTCTCTGCT CTAGAAGAGA 

2651 AAGAAAAAGA ACTGCGCCAG CTTCGTCTTG CTGTGAGAGA GCGAGATCAT 

2701 GACTTAGAGA GACTGCGCGA TGTCCTCTCC TCCAATGAAG CTACTATGCA 

27 51 AAGTATGGAG AGTCTCCTGA GGGCC AAAGG CCTGGAAGTG GAACAGTTAT 

2801 CTACTACCTG TCAAAACCTC CAGTGGCTGA AAGAACA.AAT GGAAACCAAA 

2851 TTTACCCGTT GGCAGAAGGA ACAAGAGAGT ATCAT7CAGC AGTTACAGAC 

2901 GTCrCTTCAT GATAGGAACA AAGAAGTGGA GGATCTTAGT GCAACACTGC 
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FIGURE 4 (com) 

2951 TCTGCAAACT TGGACCAGGG CAGAGTGAGA TAGCAGAGGA GCTG7GCCAG 
3001 CGTCTACAGC GAAAGGAAAG GATGCTGCAG GACCTTCTAA GTGATCGAAA 
3051 TAAACAAGTG CTGGAACATG AAATGGAGAT TCAAGGCCTG CTTCAGTCTG 
3101 TGAGCACCAG GGAGCAGGAA AGCCAAGCTG CTGCAGAGAA GTTGGTGCAA 
3151 GCCTTAATGG AAAGAAATTC AGAATTACAG GCCCTGCGCC AATATTTAGG 
3201 AGGGAGAGAC TCCCTGATGT CCCAAGCACC CATCTCTAAC CAACAAGCTG 
3251 AAGTTACCCC CACTGGCCGT CTTGGAAAAC AGACTGATCA AGGTTCAATG 
3301 CAGATACCTT CCAGAGATGA TAGCACTTCA TTGACTGCCA AAGAGGATGT 
3351 CAGCATACCC AGATCCACAT TAGGAGACTT GGACACAGTT GCAGGGCTGG 
3401 AAAAAGAACT GAGTAATGCC AAAGAGGAAC TTGAACTCAT GGCTAAAAAA 
3451 GAAAGAGAAA GTCAGATGGA ACTTTCTGCT CTACAGTCCA TGATGGCTGT 
3501 GCAGGAAGAA GAGCTGCAGG TGCAGGCTGC TGATATGGAG TCTCTGACCA 
3551 GGAACATACA GATTAAAGAA GATCTCATAA AGGACCTGCA AATGCAACTG 

3601 GTTGATCCTG AAGACATACC AGCTATGGAA CGCCTGACCC AGGAAGTCTT 

3651 ACTTCTTCGG GAAAAAGTTG CTTCAGTAGA ATCCCAGGGT CAAGAAATTT 

3701 CAGGAAACCG AAGACAACAG TTGCTGCTGA TGCTAGAAGG ACTAGTAGAT 

3751 GAACGGAGTC GGCTCAATGA GGCCTTACAA GCAGAGAGAC AGCTCTATAG 

3801 CAGTCTGGTG AAGTTCCATG CCCATCCAGA GAGCTCTGAG AGA6ACCGAA 

3851 CTCTGCAGGT GGAACTGGAA GGGGCTCAGG TGTTACGCAG TCGGCTAGAA 

3901 GAAGTTCTTG GAAGAAGCTT GGAGCGCTTA AACAGGCTGG AGACCCTGGC 

3951 CGCCATTGGA GGTGCAGCTG CAGGGGATGA CACCGAAGAT ACAAGCACTG 

4001 AGTTCACTGA CAGTATTGAG GAGGAGGCTG CACACCATAG TCACCAGCAA 

4051 CTTGTCAAGG TGGCTTTGGA GAAAAGTCTG GCAACTGTGG AGACCCAGAA 

4101 CCCATCTTTT TCCCCTCCTT CTCCGATGGG AGGGGACAGT AACAGGTGTC 

4151 TTCAGGAAGA AATGCTCCAC CTGAGGGCTG AGTTCCACCA GCACTTAGAA 

4201 GAGAAGAGGA AAGCTGAGGA GGAACTGAAG GAGCTAAAGG CTCAAATTGA 

4251 GGAAGCAGGA TTCTCCTCAG TGTCCCACAT CAGGAACACC ATGCTGAGCC 

4301 TTTGCCTTGA GAATGCGGAG CTGAAAGAGC AGATGGGAGA AGCAATGTCT 

4351 GATGGATGGG AGATCGAGGA AGACAAGGAG AAGGGCGAGG TGATGGTTGA 

4401 GACTGTGGTA ACCAAAGAGG GTCTGAGTGA GAGTAGCCTT CAGGCTGAGT 

4451 TCAGAAAGCT CCAGGGAAAA CTGAAGAATG CCCACAATAT CATCAACCTC 

4501 CTCAAAGAAC AACTTGTGCT GAGTAGCAAG GAAGGGAATA GTAAACTTAC 

4551 TCCAGAGCTC CTTGTGCATC TGACCAGCAC CATTGAAAGA ATAAACACAG 

4601 AACTGGTTGG TTCCCCTGGG AAGCACCAAC ACCAAGAGGA GGGGAATGTG 

4651 ACTGTGAGGC CTTTCCCCAG ACCCCAGAGC CTTGACCTTG GGGCTACCTT 

4701 CACAGTGGAT GCCCACCAAT TGGATAACCA GTCCCAGCCT CGTGACCCTG 

4751 GGCCTCAGTC AGCGTTTAGC CTACCAGGGT CCACCCAGCA CCTGCGCTCC 

4801 CAGCTGTCAC AATGCAAACA ACGCTATCAA GATCTCCAGG AGAAGCTGCT 

4851 GCTATCAGAA GCCACTGTCT TTGCTCAGGC TAACGAGCTG GAGAAATACA 

4 901 GAGTTATGCT TACAGGTGAA TCCTTGGTGA AGCAGGACAG CAAGCAGATC 

4951 CAGGTGGACC TCCAGGACCT GGGCTATGAG ACTTGTGGCC 6AAGCGAGAA 

5001 TGAGGCTGAA CGGGAGGAAA CCACCAGTCC TGAGTGTGAG GAGCACAACA 

5051 GCCTCAAGGA AATGGTCCTG ATGGAGGGGC TGTGCTCTGA GCAGGGACGC 

5101 CGGGGCTCAA CACTGGCTAG TTCCTCTGAG AGGAAGCCCT TGGAGAACCA 

5151 GCTAGGGAAG CAGGAAGAGT TCCGGGTATA TGGAAAGTCA GAAAACATCT 

5201 TGGTCCTACG AAAGGACATC AAAGATCTGA AGGCCCAGCT GCAGAATGCC 

5251 AACAAGGTCA TTCAAAACCT CAAGAGCCGG GTCCGGTCCC TCTCAGTTAC 

5301 AAGTGATTAT TCGTCTAGTC TGGAAAGACC CCGGAAGCTG AGAGCTGTTG 

5351 GCACCTTGGA GGGGTCTTCA CCTCATAGTG TCCCTGATGA GGATGAGGGG 

5401 TGGCTGTCTG ATGGCAC7GG GGCTTTCTAC TCTCCAGGGC TTCAGGCCAA 

5451 AAAGGACCTG GAGAGTCTCA TCCAGAGAGT ATCCCAGCTG GAGGCCCAGC 

5501 TCCCAAAAAA TGGACTAGAA GAGAAGCTGG CTGAGGAGCT GAGATCAGCC 

5551 TCGTGGCCTG GGAAATATGA TTCCCTGATT CAGGATCAGG CCCGGGAACT 

5601 GTCTTACCTA CGGCAAAAAA TACGAGAAGG GAGAGGTATT TGTTATCTTA 

5651 TCACCCGGCA TGCAAAAGAT ACAGTAAAAT CTTTTGAGGA TCTCCTAAGG 

5701 AGCAATGACA TTGACTACTA CCTGGGACAG AGCTTCCGGG AGCAACTCGC 

57 SI CCAGGGAAGC CAGCTGACAG AGAGGCTCAC CAGCAAACTC AGCACCAAGG 

560 1 ATCATAAAAG TGAGAAAGAT CAAGCTGGAC TTGAGCCACT GGGGCTCAGG 

5331 CTCAGCAGGG AGCTGCAGGA GAAGGAGAAA GTGATTGAAG TCCTGCAGGC 

59C: CAAGCTGGAT GCTCGGTCCC TCACACCCTC CAGCAGCCAT GCCTTGTCTG 

59=: ACTCCCACCG CTCTCCCAGC AGCACCTCTT TCCTGTCTGA TGAACTGGAA 
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FIGURE 4(C0NT) 

6001 GCCTGCTCTG ACATGGACAT AGTCAGCGAG TACACACACT ATGAAGAGAA 
6051 GAAAGCTTCT CCCAGTCACT CAGATTCCAT CCATCATTCG AGTCATTCTG 
6101 CTGTGTTGTC TTCTAAACCA TCATCAACCA GTGCATCTCA GGGGGCTAAG 
6151 GCCGAATCCA ACAGCAACCC CATCAGCTTG CCAACTCCCC AGAATACCCC 
6201 CAAGGAGGCC AACCAGGCCC ATTCAGGCTT TCATTTTCAC TCCATACCCA 
6251 AGCTGGCTAG CCTTCCTCAG GCACCATTGC CCTCAGCTCC ATCCAGCTTC 
6301 CTGCCTTTCA GCCCCACTGG CCCTCTCCTC CTTGGCTGCT GTGAGACACC 
6351 AGTGGTCTCC TTGGCTGAGG CTCAGCAGGA GCTACAGATG CTGCAGAAGC 
6401 AGTTGGGAGA AAGTGCCAGC ACTGTTCCTC CTGCTTCCAC AGCTACATTG 
64 51 CTGAGCAACG ACTTGGAAGC CGACTCTTCC TACTACCTCA ACTCTGCCCA 
6501. GCCTCACTCT CCTCCAAGGG GCACCATAGA ACTGGGAAGA ATCCTAGAGC 
6551 CTGGGTACCT GGGCAGCAGT GGCAAGTGGG ATGTGATGAG GCCTCAGAAA 
6601 GGGAGTGTAT CTGGGGACCT ATCCTCAGGC TCCTCTGTGT ACCAGCTTAA 
6651 CTCCAAACCC ACAGGGGCTG ACCTGCTGGA AGAGCATCTT GGTGAAATCC 
6701 GGAACCTGCG CCAGCGCCTG GAGGAGTCCA TCTGCATCAA TGACCGCCTA 
6751 CGGGAGCAAC TGGAACACCG GCTGACCTCT ACTGCTCGTG GAAGGGGATC 
6801 CACTTCTAAC TTCTACAGTC AGGGCCTGGA GTCCATACCT CAGCTCTGCA 
6851 ATGAGAACAG AGTCCTCAGG GAAGACAATC GAAGACTTCA GGCTCAACTG 
6901 AGTCATGTTT CCAGAGAGCA CTCCCAGGAA ACAGAAAGCC TGAGGGAGGC 
6951 TCTGCTGTCC TCTCGATCCC ACCTTCAAGA GCTGGAAAAG GAGCTGGAGC 
7001 ACCAGAAGGT GGAAAGGCAG CAGCTTTTGG AAGACTTGAG GGAGAAGCAG 
7051 CAAGAGGTCT TGCATTTCAG GGAGGAACGT CTTTCCCTCC AGGAAAACGA 
7101 CTCCAGTGGG CCTTGCCTCT CCCTGGTCAG ACTGCAGCAC AAGCTGGTTC 
7151 TCCTGCAGCA ACAGTGTGAA GAGAAACAGC AGCTCTTTGA GTCCCTCCAG 
7201 TCAGAGCTAC AAATCTACGA GGCACTTTAT GGCAATTCCA AGAAGGGGCT 
7251 GAAAGCTTAC AGCCTGGATG CCTGTCACCA AATCCCTTTG AGCAGTGACC 
7301 TGAGCCACCT GGTGGCAGAG GTACGAGCTC TGAGAGGGCA GCTGGAGCAG 
7351 AGCATTCAGG GGAACAATTG TCTGCGACTG CAGCTGCAAC AGCAGCTGGA 
7401 GAGCGGTGCT GGCAAAGCCA GCCTCAGCCC CTCCTCCATT AACCAGAACT 
7451 TCCCAGCCAG CACTGACCCT GGAAACAAGC AGCTGCTCCT CCAAGATTCA 
7501 GCTGTGTCCC CTCCAGTCCG GGATGTTGGT ATGAATTCCC CAGCTCTGGT 
7551 CTTCCCCAGC TCTGCTTCCT CTACTCCTGG CTCAGAAACG CCCATAATCA 
7601 ACAGAGCAAA TGGCTTGGGT TTGGATACTT CTCCAGTAAT GAAGACCCCT 
7651 CCCAAGCTAG AGGGT6ATGC TACTGATGGC TCCTTTGCCA ATAAGCATGG 
7701 CCGCCATGTC ATTGGCCACA TTGATGACTA CAGTGCCCTA AGACAGCAGA 
7751 TTGCGGAGGG CAAGCTGCTG GTCAAAAAGA TAGTGTCTCT TGTGAGATCA 
7801 GCGTGCAGCT TCCCTGGCCT TGAAGCCCAA GGCACAGAGG TGCTAGGCAG 
7851 CAAAGGTATT CATGAGCTTC GGAGCAGCAC CAGTGCCCTG CACCATGCCC 
7901 TAGAGGAGTC GGCTTCCCTC CTCACCATGT TCTGGAGAGC AGCCCTGCCA 
7951 AGCACCCACA TCCCTGTGCT GCCTGGCAAA GTGGGAGAAT CAACAGAAAG 
8001 GGAACTTCTG GAACTGAGAA CCAAAGTATC CAAACAGGAG CGGCTCCTTC 
8051 AGAGCACAAC TGAGCATCTG AAGAACGCCA ACCAGCAGAA GGAGAGCATG 
8101 GAGCAGTTCA TCGTCAGCCA GCTAACCAGA ACACATGATG TTTTAAAGAA 
8151 GGCAAGGACT AACTTAGAGG TGAAATCCCT AAGGGCTCTG CCATGTACTC 
8201 CAGCCTTGTG ACCCTTGCCT TCCAGGAACC ATGCAAGAAG CGCAGCCACC 
8251 AGAAGTCCTT AAAACAGCAG GAAAGGTGGG CCTGTCCCCC TTTTGTGCAG 
8301 CTACCTATCT GCTGAGGAGC ATCTGGGCCT CATTCCTCCA AGTCCACGGG 
8351 AGGGTCCAGA AGAGGGAGTC AGAGATGTAT CCTGGTGGAG CTGGGAGAAA 
8401 GGCAGAAAGC CTTTCTGACA GCTATGGAAT ACGATTAGCC AAGGTCCACT 
8451 TGGCCCAGCA CTAAGAAAAA GATGCGTAGT TTGCACAGAA GGTTTTGTGA 
8501 TCCTGCCTCT CAACAGCCCC AGQAGCTTGG GAACTAGCAA GAGCACATTT 
8551 CTTGCCTCAT CAGCTGTCCT GAGATGGAAA ACTCAGTGGA TATAGGACCC 
8601 TGATTCCGAT GAAAGGGGCA CGTGGTCCCA ATGCTGGAGC TCCTCTGGCA 
8651 GGTTCTAAAA GCACACTACT GAGCAGCGGT GCCCTGCCGG ACACTGCTGG 
8701 CGGGGGCTCA GTGAGCACTA CTCACAGATC CACACCTGAC CCTGTTGGGT 
8751 CGAGTCAGGC TGGGCCTTGG TCTGCACTGT AGCACCTGTG TTCTTTGAGT 
8801 TCACATCATG AATGTGGTGA CTTCCCAGAT ACCATCTCAG GCTTAACCTA 
8951 GCACA7CCTA TTTCTTTTCT TCTATGATAT CCAAATTGGA CTGACCTCAC 
8901 TTCAAAGTTG CTGTCCCATT TTGTCACCCT ATCTTATCTC GGGGAAATTG 
8951 CAGACTGATG GCCAGACCAA CTCTGTTG;^^ ATTCTTGCAT AGAGCAAACC 
9001 TGTGCTCATT TTTAAGTGGC ATGGGAGAGG CCCCCAGCCT AGTAAAGCCT 
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FIGURE 4 (CONT) 

9051 AGTCTGTGTC TTCACAGTGC TGGTAGAATG TGTTTGTGTG TATAAATATA 

9101 TGATATAGAT TTATATATGT TGCTAACGCC ATATATTGAA GGCCAACATA 

9151 ACTGGTGGAC AGGGTGGGTG ACAGAAAATG AAAGCCTTTT TGGTGATTGT 

9201 TAAAGCAAGA TGTGTATAAA GAAATAAATA GTTTTTCTTT C (SEQ ID NO: 04) 
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FIG. 5 

>Human myomegaiin procein 

1 MKEICRICAR ELCGNQRRWI FHTASKLNLQ VLLSHVLGKD VPRDGKAEFA 

51 CSKCAFMLDR lYRFDTVIAR lEALSIERLQ KLLLEKDRLK FCIASMYRKN 

101 NDDSGAEIKA GNGTVDMSVL PDARYSALLQ EDFAYSGFEC WVENEDQIQE 

151 PHSCHGSEGP GNRPRRCRGC AALRVADSDY EAICKVPRKV ARSISCGPSS 

201 RWSTSICTEE PALSEVGPPD LASTKVPPDG ESMEEETPGS SVESLDASVQ 

251 ASPPQQKDEE TERSAKELGK CDCCSDDQAP QHGCNHKLEL ALSMIKGLDY 

301 KPIQSPRGSR LPIPVKSSLP GAKPGPSMTD GVSSGFLNRS LKPLYKTPVS 

351 YPLELSDLQE LWDDLCEDYL PLRVQPMTEE LLKQQKLNSH ETTITQQSVS 

401 DSHLAELQEK IQQTEATNKI LQEKLNEMSY ELKCAQESSQ KQEX^TIQNLK 

451 ETLKSRERET EELYQVIEGQ NDTMAKLREM LHQSQLGQLH SSEGTSPAQQ 

501 QVALLDLQSA LFCSQLEIQK LQRWRQKER QLADAKQCVQ FVEAAAHESE 

551 QQKEASWKHN QELRKALQQL QEELQNKSQQ LRAWEAEKYN EIRTQEQNIQ 

601 HLNHSLSHKE QLLQEFRELL QYRDNSDKTL EANEMLLEKL RQRIHDKAVA 

651 LERAIDEKFS ALEEKEKELR QLRLAVRERD HDLERLRDVL SSNEATMQSM 

701 ESLLRAKGLE VEQLSTTCQN LQWLKEEMET KFSRWQKEQE SIIQQLQTSL 

751 HDRNKEVEDL SATLLCKLGP GQSEIAEELC QRLQRKERML QDLLSDRNKQ 

801 VLEHEMEIQG LLQSVSTREQ ESQAAAEKLV QALMERNSEL QALRQYLGGR 

851 DSLMSQAPIS NQQAEVTPTG RLGKQTDQGS MQIPSRDDST SLTAKEDVSI 

901 PRSTLGDLDT VAGLEKELSN AKEELELMAK KERESQMELS ALQSMMAVQE 

951 EELQVQAADM ESLTRNIQIK EDLIKDLQMQ LVDPEDIPAM ERLTQEVLLL 

1001 REKVASVESQ GQEISGNRRQ QLLLMLEGLV DERSRLNEAL QAERQLYSSL 

1051 VKFHAHPESS ERDRTLQVEL EGAQVLRSRL EEVLGRSLER LNRLETLAAI 

1101 GGAAAGDDTE DTSTEFTDSI EEEAAHHSHQ QLVKVALEKS LATVETQNPS 

1151 FSPPSPMGGD SNRCLQEEML HLRAEFHQHL EEKRKAEEEL KELKAQIEEA 

1201 GFSSVSHIRN TMLSLCLENA ELKEQMGEAM SDGWEIEEDK EKGEVMVETV 

1251 VTKEGLSESS LQAEFRKLQG BaKNAHNIIN LLKEQLVLSS KEGNSKLTPE 

1301 LLVHLTSTIE RINTELVGSP GKHQHQEEGN VTVRPFPRPQ SLDLGATFTV 

1351 DAHQLDNQSQ PRDPGPQSAF SLPGSTQHLR SQLSQCKQRY QDLQEKLLLS 

1401 EATVFAQANE LEKYRVMLTG ESLVKQDSKQ IQVDLQDLGY ETCGRSENEA 

1451 EREETTSPEC EEHNSLKEMV LMEGLCSEQG RRGSTLASSS ERKPLENQLG 

1501 KQEEFRVYGK SENILVLRKD IKDLKAQLQN ANBCVIQNLKS RVRSLSVTSD 

1551 YSSSLERPRK LRAVGTLEGS SPHSVPDEDE GWLSDGTGAF YSPGLQAKKD 

1601 LESLIQRVSQ LEAQLPKNGL EEKLAEELRS ASWPGKYDSL IQDQARELSY 

1651 LRQKIREGRG ICYLITRHAK DTVKSFEDLL RSNDIDYYLG QSFREQLAQG 

1701 SQLTERLTSK LSTKDHKSEK DQAGLEPLAL RLSRELQEKE KVIEVLQAKL 

1751 DARSLTPSSS HALSDSHRSP SSTSFLSDEL EACSDMDIVS EYTHYEEKKA 

1801 SPSHSDSIHH SSHSAVLSSK PSSTSASQGA KAESNSNPIS LPTPQNTPKE 

1851 ANQAHSGFHF HSIPKIASLP QAPLPSAPSS FLPFSPTGPL LLGCCETPW 

1901 SLAEAQQELQ MLQKQLGESA STVPPASTAT LLSNDLEADS SYYLNSAQPH 

1951 SPPRGTIELG RILEPGYLGS SGKWDVMRPQ KGSVSGDLSS GSSVYQLNSK 

2001 PTGADLLEEH LGEIRNLRQR LEESICINDR LREQLEHRLT STARGRGSTS 

2051 NFYSQGLESI PQLCNENRVL REDNRRLQAQ LSHVSREHSQ ETESLREALL 

2101 SSRSHLQELE KELEHQECVER QQLLEDLREK QQEVLHFREE RLSLQENDSS • 

2151 GPCLSLVRLQ HKLVLLQQQC EEKQQLFESL QSELQIYEAL YGNSKKGLKA 

2201 YSLDACHQIP LSSDLSHLVA EVRALRGQLE QSIQGNNCLR LQLQQQLESG 

2251 AGKASLSPSS INQNFPASTD PGNKQLLLQD SAVSPPVRDV GMNSPALVFP 

2301 SSASSTPGSE TPIINRANGL GLDTSPVMKT PPKLEGDATD GSFANKHGRH 

2351 VIGHIDDYSA LRQQIAEGKL LVKKIVSLVR SACSFPGLEA QGTEVLGSKG 

2401 IHELRSSTSA LHHALEESAS LLTMFWRAAL PSTHIPVLPG KVGESTEREL 

24 51 LELRTKVSKQ ERLLQSTTEH LKNANQQKES MEQFIVSQLT RTHDVLKKAR 

2501 TNLEVKSLRA LPCTPAL (SEQ ID NO; 05) 
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12/12 
FIGURE 6 

M14 PROTEIN 

MMAQFPTAMNGGPNMWAITSEERTKHDKQFDNLKPSGGYITGDQARTFFLQSGLPAPVL 

AEIWALSDLNKDGKMDQQEFSIAMKLIKLKLQGQQLPVVLPPIMKQPPMFSPLISARFG 

MGSMPNLSIHQPLPPVAPITAPLSSATSGTSIPPLMMPAPLVPSVSTSSLPNGTASLIQ 

PLSIPYSSSTLPHASSYSLMMGGFGGASIQKAQSLIDLGSSSSTSSTASLSGNSPKTGT 

SEWAVPQPSRLKYRQKFNSLDKSMSGYLSGFQARNALLQSNLSQTQLATIWTLADIDGD 

GQLKAEEFILAMHLTDf4AKAGQPLPLTLPPELVPPSFRGGKQIDSINGTLPSYQKTQEE 

EPQKKLPVTFEDKRECANYERGNMELEKRRQVLMEQQQREAERKAQKEKEEWERKQRELQ 

EQEWKKQLELEKRLEKQRELERQREEERRKEIERREAAKQELERQRRLEWERIRRQELL 

NQKNREQEEIVRLNSKKKSLHLELEAVNGKHQQISGRLQDVRIRKQTQKTELEVLDKQC 

DLEIMEIKQLQQELQEYQNKLIYLVPEKQLLNERIKNMQLSNTPDSGISLLHKKSSEKE 

ELCQRLKEQLDALEKETASKLSEMDSFNNQLKCGNMDDSVLQCLLSLLSCLNNLFLLLK 

ELRESYNTQQLALEQLHKIKRDKLKELERKRLEQIQKKKLEDEAARKAKQGKENLWKES 

IRKEEEEKQKRLQEEKSQDRTQEEERKTEAKQSETARALVNYRALYPFEARNHDEMSFN 

SGDIIQVDEKTVGEPGWLYGSFQGKFGWFPCNYVEKMLSSDKTPSPKKALLPPAVSLSA 

TSAAPQPLCSNQPAPVTDYQNVSFSNLNVNTTWQQKSAFTRTVSPGSVSPIHGQGQAVE 

NLKAQALCSWTAKKENHLNFSKHDVITVLEQQENWWFGEVHGGRGWFPKSYVKIIPGSE 

VKRGEPEALYAAVNKKPTSTAYPVGEEYIALYSYSSVEPGDLTFTEGEELLVTQKDGEW 

WTGSIGERTGIFPSNYVRPKDQENVGNASKSGASNKKPEIAQVTSAYAASGAEQLSLAP 

GQLILILKKNSSGWWQGELQARGKKRQKGWFPASHVKLLGPSAERTTPAFHAVCQVIAM 

YDYIANNEDELNFSKGQLINVMNKDDPDWWQGEINGVTGLFPSNYVKMTTDSDPSQQWC 

ADLQALDTMQPMERKRQGYIHELIETEERYMDDLQLVIEVFQKRMAESGFLTEAEMALI 

FVNWKELIMSNTKLLKALRVRKKTGGEKMPVEMMGDILAAELSHMQAYIRFCSCQLNGA 

ALLQQKTDEDADFBCEFLKKLASDPRCKGMPLSSFLLKPMQRITRYPLLIRSILENTPQN 

HVDHSSLKLALERAEELCSQVNEGVREKENSDRLEWIQAHVQCEGLAEQLIFNSLTNCL 

GPRKLLYSGKLYKTKSNKELHGFLFNDFLLLTYLVRQFAASSGFEKLFSSKSSAQFKMY 

KTPIFLNEVLVKLPTDPSSDEPVFHISHIDRVYTLRTDNINERTAWVQKIECAASEQYID 

TEKKKREKAYQARSQKTSGIGRLMVHVIEATELKACKPNGKSNPYCEISMGSQSYTTRT 

LQDTLNPKWNFNCQFFIKDLYQDVLCLTMFDRDQFSPDDFLGRTEVPVAKIRTEQESKG 

PTTRRLLLHEVPTGEVWVRFDLQLFEQKTLL (SEQ ID NO: 08) 
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SEQUENCE LISTING 



<110> Conti, Marco 

Pahlke, Gudrun 



<120> Novel Phosphodiesterase Interacting 
Proteins 

<130> SUN-IOIPCT 

<140> 60/108,255 
<141> 1998-11-12 

<160> 8 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 2326 
<212> PRT 
<213> rat 

<4 00> 1 

Met Ser Asn Gly Tyr Arg Thr Leu Ser Gin His Leu Asn Asp Leu Lvs 

1^5 10 15 

Lys Glu Asn Phe Ser Leu Lys Leu Arg He Tyr Phe Leu Glu Glu Arg 

20 25 30 

Met Gin Gin Lys Tyr Glu Val Ser Arg Glu Asp Val Tyr Lys Arg Asn 

35 40 45 

He Glu Leu Lys Val Glu Val Glu Ser Leu Lys Arg Glu Leu Gin Asd 

50 55 60 

Arg Lys Gin His Leu His Lys Thr Trp Ala Asp Glu Glu Asp Leu Asn 

70 75 80 

Ser Gin Asn Glu Ala Glu Leu Arg Arg Gin Val Glu Glu Pro Gin Gin 

85 90 95 

Glu Thr Glu His Val Tyr Glu Leu Leu Asp Asn Asn He Gin Leu Leu 

100 105 110 

Gin Glu Glu Ser Arg Phe Ala Lys Asp Glu Ala Thr Gin Met Glu Thr 

115 120 125 

Leu Val Glu Ala Glu Lys Gly Cys Asn Leu Glu Leu Ser Glu Ara Tro 

130 135 140 

Lys Asp Ala Thr Lys Asn Arg Glu Asp Ala Pro Gly Asp Gin Val Lvs 
145 150 155 160 

Leu Asp Gin Tyr Ser Ala Ala Leu Ala Gin Arg Asp Arg Arg He Glu 

165 170 175 

Glu Leu Arg Gin Ser Leu Ala Ala Gin Glu Gly Leu Val Glu Gin Leu 

180 185 190 

Ser Arg Glu Lys Gin Gin Leu Leu His Leu Leu Glu Glu Pro Gly Gly 

195 200 205 

Met Glu Val Gin Pro Met Pro Lys Gly Leu Pro Thr Gin Gin Lys Pro 

210 215 220 

Asp Leu Asn Glu Thr Pro Thr Thr Gin Pro Ser Val Ser Asp Ser His 
225 230 235 240 

Leu Ala Glu Leu Gin Asp Lys He Gin Gin Thr Glu Val Thr Asn Lys 

245 250 255 

He Leu Gin Glu Lys Leu Asn Asp M^t Ser Cys Glu Leu Arg Ser Ala 

260 265 270 

Gin Glu Ser Ser Gin Lys Gin Asp Thr Thr He Gin Ser Leu Lys Glu 

275 280 285 

Met Leu Lys Ser Arg Glu Ser Glu Thr Glu Glu Leu Tyr Gin Val He 

290 295 300 

Glu Gly Gin Asn Asp Thr Met Ala Lys Leu Pro Glu Met Leu His Gin 
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305 310 315 320 

Ser Gin Leu Gly Gin Leu Gin Ser Ser Glu Gly lie Ala Pro Ala Gin 

325 330 335 

Gin Gin Val Ala Leu Leu Asp Leu Gin Ser Ala Leu Phe Cys Ser Gin 

340 345 350 

Leu Glu He Gin Lys Leu Gin Arg Leu Leu Arg Gin Lys Glu Arg Gin 

355 360 365 

Leu Ala Asp Gly Lys Arg Cys Met Gin Phe Val Glu Ala Ala Ala Gin 

370 375 380 

Glu Arg Glu Gin Gin Lys Glu Ala Ala Trp Lys His Asn Gin Glu Leu 
385 390 395 400 

Arg Lys Ala Leu Gin His Leu Gin Gly Glu Leu His Ser Lys Ser Gin 

405 410 415 

Gin Leu His Val Leu Glu Ala Glu Lys Tyr Asn Glu He Arg Thr Gin 

420 425 430 

Gly Gin Asn He Gin His Leu Ser His Ser Leu Ser His Lys Glu Gin 

435 440 445 

Leu He Gin Glu Leu Gin Glu Leu Leu Gin Tyr Arg Asp Thr Thr Asp 

450 455 460 

Lys Thr Leu Asp Thr Asn Glu Val Phe Leu Glu Lys Leu Arg Gin Arg 
465 470 475 480 

He Gin Asp Arg Ala Val Ala Leu Glu Arg Val He Asp Glu Lys Phe 

485 490 495 

Ser Ala Leu Glu Glu Lys Asp Lys Glu Leu Arg Gin Leu Arg Leu Ala 

500 505 510 

Val Arg Asp Arg Asp His Asp Leu Glu Arg Leu Arg Cys Val Leu Ser 

515 520 525 

Ala Asn Glu Ala Thr Met Gin Ser Met Glu Ser Leu Leu Arg Ala Arg 

330 535 540 

Gly Leu Glu Val Glu Gin Leu He Ala Thr Cys Gin Asn Leu Gin Trp 
345 550 555 560 

Leu Lys Glu Glu Leu Glu Thr Lys Phe Gly His Trp Gin Lys Glu Gin 

565 570 575 

Glu Ser He He Gin Gin Leu Gin Thr Ser Leu His Asp Arg Asn Lys 

580 585 590 

Glu Val Glu Asp Leu Ser Ala Thr Leu Leu His Lys Leu Gly Pro Gly 

595 600 605 

Gin Ser Glu Val Ala Glu Glu Leu Cys Gin Arg Leu Gin Arg Lys Glu 

610 615 620 

Arg Val Leu Gin Asp Leu Leu Ser Asp Arg Asn Lys Gin Ala Met Glu 
625 630 635 640 

His Glu Met Glu Val Gin Gly Leu Leu Gin Ser Met Gly Thr Arg Glu 

645 650 655 

Gin Glu Arg Gin Ala Val Ala Glu Lys Met Val Gin Ala Phe Met Glu 

660 665 670 

Arg Asn Ser Glu Leu Gin Ala Leu Arg Gin Tyr Leu Gly Gly Lys Glu 

675 680 685 

Leu Met Ala Ala Ser Gin Ala Phe He Ser Asn Gin Pro Ala Gly Ala 

690 695 700 

Thr Ser Val Gly Pro His His Gly Glu Gin Thr Asp Gin Gly Ser Thr 
705 710 715 720 

Gin Met Pro Ser Arg Asp Asp Ser Thr Ser Leu Thr Ala Arg Glu Glu 

725 730 735 

Ala Ser He Pro Arg Ser Thr Leu Gly Asp Ser Asp Thr Val Ala Gly 

740 745 750 

Leu Glu Lys Glu Leu Ser Asn Ala Lys Glu Glu Leu Glu Leu Met Ala 

755 760 . 765 

Lys Lys Glu Arg Glu Ser Gin He Glu Leu Ser Ala Leu Gin Ser Met 

770 775 780 

Met Ala Val Gin Glu Glu Glu Leu Gin Val Gin Ala Ala Asp Leu Glu 
785 790 795 800 

Ser Leu Thr Arg Asn He Gin He Lys Glu Asp Leu He Lys Asp Leu 
805 810 815 
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Gin 


Met 


Gin 


Leu 


Val 


Asp Pro Glu Asp 


Met 


Pro 


Ala 


Met Glu Arg Leu 








820 




825 








830 




Thr 


Gin 


Glu 


Val 


Leu 


Leu Leu Arg Glu 


Lys 


Val 


Ala 


Ser Val 


Glu Pro 






835 






840 








845 




Gin 


Gly 


Gin 


Glu 


Gly 


Ser Glu Asn Arg Arg 


Gin 


Gin 


Leu Leu 


Leu Met 




850 








855 






860 






Leu 


Glu 


Gly Leu 


Val 


Asp Glu Arg Ser Arg 


Leu 


Asn 


Glu Ala 


Leu Gin 


865 










870 




875 






830 


Ala 


Glu 


Arg 


Gin 


Leu 


Tyr Ser Ser Leu 


Val 


Lys 


Phe 


His Ala 


Gin Pro 










885 




890 








895 


Glu 


lie 


Ser 


Glu 


Arg 


Asp Arg Thr Leu 


Gin 


Val 


Glu 


Leu Glu 


Gly Ala 








900 




905 








910 


Gin 


Val 


Leu Arg 


Ser 


Arg Leu Glu Glu 


Val 


Leu 


Gly 


Arg Ser 


Leu Glu 






915 






920 








925 




Arg Leu 


Ser Arg 


Leu 


Glu Thr Leu Ala 


Ala 


He 


Gly 


Gly Ala 


Thr Ala 




930 








935 






940 




Gly 


;.sp 


Glu 


Thr 


Glu 


Asp Thr Ser Thr 


Gin 


Phe 


Thr 


Asp Ser 


He Glu 


945 










950 




955 




960 


Glu 


Glu 


Ala 


Ala 


His 


Asn Ser His Gin 


Gin 


Leu 


He 


Lys Val 


Ser Leu 










965 




970 






975 


Glu 


Lys 


Ser 


Leu 


Thr 


Thr Met Glu Thr 


Gin 


Asn 


Thr 


Cys Leu 


Gin Pro 








980 




985 








990 




Pro 


Ser 


Pro 


Val 


Gly 


Glu Asp Gly Asn Arg 


His 


Leu 


Gin Glu 


Glu Met 






995 






1000 








1005 




Leu 


His 


Leu Arg 


Ala 


Glu He His Gin 


Pro 


Leu 


Glu 


Glu Lys Arg Lys 




1010 






1015 






1020 




Ala 


Glu 


Ala 


Glu 


Leu 


Lys Glu Leu Lys 


Ala 


Gin 


He 


Glu Glu 


Ala Gly 


1025 








1030 




1035 




1040 


Phe 


Ser 


Ser 


Val 


Ser 


His He Arg Asn Thr 


Met 


Leu 


Ser Leu 


Cys Leu 










1045 


1050 






1055 


Cys 


Leu 


Glu 


Asn 


Ala 


Glu Leu Lys Glu Gin Met Gly Glu Ala Met Ser 








1060 


1065 






1070 


Asp 


Gly Trp 


Glu 


Val 


Glu Glu Asp Lys 


Glu 


Lys 


Gly Glu Val Met Val 






1075 




1080 








1085 




Glu 


Thr 


Val 


Val 


Ala 


Lys Gly Gly Leu Ser Glu Asp Ser Leu Gin Ala 




1090 






1095 






1100 




Glu Phe Arg Lys 


Val 


Gin Gly Arg Leu Lys 


Ser 


Ala 


Tyr Asn 


He He 


1105 








1110 




1115 


1120 


Asn 


Leu 


Leu 


Lys 


Glu 


Gin Leu Val Leu Arg 


Ser 


Ser 


Glu Gly Asn Thr 










1125 


1130 






1135 


Lys 


Glu 


Met 


Pro 


Glu 


Phe Leu Val Arg Leu Ala Arg 


Glu Val Asp Arg 








1140 


1145 






1150 


Met 


Asn 


Met 


Gly 


Leu 


Pro Ser Ser Glu 


Lys 


His 


Gin 


His Gin 


Glu Gin 



1155 1160 1165 



Glu Asn Met Thr Ala Arg Pro Gly Pro Arg Pro Gin Ser Leu Lys Leu 

1170 1175 1180 

Gly Thr Ala Leu Ser Val Asp Gly Tyr Gin Leu Glu Asn Lys Ser Gin 
1185 1190 1195 1200 

Ala Gin Asp Ser Gly His Gin Pro Glu Phe Ser Leu Pro Gly Ser Thr 

1205 1210 1215 

Lys His Leu Arg Ser Gin Leu Ala Gin Cys Arg Gin Arg Tyr Gin Asp 

1220 1225 1230 

Leu Gin Glu Lys Leu Leu He Ser Glu Ala Thr Val Phe Ala Gin Ala 

1235 1240 1245 

Asn Gin Leu Glu Lys Tyr Arg Ala He Leu Ser Glu Ser Leu Val Lys 

1250 1255 1260 

Gin Asp Ser Lys Gin He Gin Val Asp Leu Gin Asp Leu Gly Tyr Glu 
1265 1270 1275 1280 

Thr Cys Gly Arg Ser Glu Asn Glu Ala Glu Arg Glu Glu Thr Thr Ser 

1285 1290 1295 

Pro Glu Cys Glu Glu His Gly Asn Leu Lys Pro Val Val Leu Val Glu 

1300 1305 1310 

Gly Leu Cys Ser Glu Gin Gly Tyr Leu Asp Pro Val Leu Val Ser Ser 
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1315 1320 1325 

Pro Val Lys Asn Pro Trp Arg Thr Ser Gin Glu Ala Arg Arg He Gin 

1330 1335 1340 

Ala Gin Gly Thr Ser Asp Asn Ser Ser Leu Leu Arg Lys Asp He Ara 
1345 1350 1355 1350 

Asn Leu Lys Ala Gin Leu Pro Asn Ala Tyr Lys Val Leu Gin Asn Leu 

1365 1370 1375 

Arg Ser Arg Val Arg Ser Leu Ser Ala Thr Ser Asp Tyr Ser Ser Ser 

1380 1385 1390 

Leu Glu Arg Pro Arg Lys Leu He Ala Val Ala Thr Leu Glu Gly Ala 

1395 1400 1405 

Ser Pro His Ser Val Thr Asp Glu Asp Glu Gly Leu Leu Ser Asp Glv 

1410 1415 1420 

Thr Gly Ala Phe Tyr Pro Pro Gly Leu Gin Ala Lys Lys Asn Leu Glu 
1425 1430 1435 1440 

Asn Leu He Gin Arg Val Ser Gin Leu Glu Ala Gin Leu Pro Lys Thr 

1445 1450 1455 

Gly Leu Glu Gly Lys Leu Ala Glu Glu Leu Lys Ser Ala Ser Trp Pro 

1460 1465 1470 

Gly Lys Tyr Asp Ser Leu He Gin Asp Gin Ala Arg Lys Thr Val He 

1475 1480 1485 

Ser Ala Ser Glu Asn Thr Lys Arg Glu Lys Asp Leu Phe Ser Ser His 

1490 1495 1500 

Pro Thr Phe Glu Arg Tyr Val Lys Ser Phe Glu Asp Leu Leu Arg Asn 
1S05 1510 1515 1520 

Asn Asp Leu Thr Thr Tyr Leu Gly Gin Ser Phe Arg Glu Gin Leu Ser 

1525 1530 1535 

Ser Arg Arg Ser Val Thr Asp Arg Leu Thr Ser Lys Phe Ser Thr Lys 

1540 1545 1550 

Asp His Lys Ser Glu Lys Glu Glu Val Gly Leu Glu Pro Leu Ala Phe 

1555 1560 1565 

Arg Phe Ser Arg Glu Leu Gin Glu Lys Glu Lys Val He Glu Val Leu 

1570 1575 1580 

Gin Ala Lys Val Asp Thr Arg Phe Phe Ser Pro Pro Ser Ser His Ala 
1585 1590 1595 I6OO 

Ala Ser Glu Ser His Arg Cys Ala Ser Ser Thr Ser Phe Leu Ser Asp 

1605 1610 1615 

Asp He Glu Ala Cys Ser Asp Met Asp Val Ala Ser Glu Tyr Thr His 

1620 1625 1630 

Tyr Glu Glu Lys Lys Pro Ser Pro Ser Asn Ser Ala Ala Ser Ala Ser 

1635 1640 1645 

Gin Gly Leu Lys Gly Glu Pro Arg Ser Ser Ser He Ser Leu Pro Thr 

1650 1655 1660 

Pro Gin Asn Pro Pro Lys Glu Ala Ser Gin Ala Gin Pro Gly Phe His 
1665 1670 1675 1680 

Phe Asn Ser He Pro Lys Pro Ala Ser Leu Ser Gin Ala Pro Met His 

1685 1690 1695 

Phe Thr Val Pro Ser Phe Met Pro Phe Gly Pro Ser Gly Pro Pro Leu 

1700 1705 1710 

Leu Gly Cys Cys Glu Thr Pro Val Val Ser Leu Ala Glu Ala Gin Gin 

1715 1720 1725 

Glu Leu Gin Met Leu Gin Lys Gin Leu Gly Arg Ser Val Ser He Ala 

1730 1735 1740 

Pro Pro Thr Ser Thr Ser Thr Leu Leu Ser Asn His Thr Glu Ala Ser 
1745 1750 1755 1760 

Ser Pro Arg Tyr Ser Asn Pro Ala Gin Pro His Ser Pro Ala Arg Gly 

1765 . 1770 1775 

Thr He Glu Leu Gly Arg He Leu Glu Pro Gly Tyr Leu Gly Ser Gly 

1780 1785 1790 

Gin Trp Asp Met Met Arg Pro Gin Lys Gly Ser He Ser Gly Glu Leu 

1795 1800 1805 

Ser Ser Gly Ser Ser Met Tyr Gin Leu Asn Ser Lys Pro Thr Gly Ala 
1810 1815 1820 
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Asp Leu Leu Glu Glu His Leu Gly Glu He Arg Asn Leu Arg Gin Ara 
1825 1830 1835 1840 

Leu Glu Glu Ser He Cys Val Asn Asp Arg Leu Arg Glu Gin Leu Gin 

1845 1850 1855 

His Arg Leu Ser Ser Thr Ala Arg Glu Asn Gly Ser Thr Ser His Phe 

I860 1865 1870 

Tyr Ser Gin Gly Leu Glu Ser Met Pro Gin Leu Tyr Asn Glu Asn Ara 

1875 1880 1885 

Ala Leu Arg Glu Glu Asn Gin Sec Leu Gin Thr Arg Leu Ser His Ala 

1890 1895 1900 

Ser Arg Gly His Ser Gin Glu Val Asp His Leu Arg Glu Ala Leu Leu 
1905 1910 1915 1920 

Ser Ser Ser Ser Gin Leu Gin Glu Leu Glu Lys Glu Leu Glu Gin Gin 

1925 1930 1935 

Lys Ala Glu Arg Arg Gin Leu Leu Glu Asp Leu Gin Glu Lys Gin Asp 

1940 1945 1950 

Glu He Val His Phe Arg Glu Glu Arg Leu Ser Leu Gin Glu J^.sn Asn 

1955 1960 1965 

Ser Arg Leu Gin His Lys Leu Ala Leu Leu Gin Gin Gin Cys Glu Glu 

1970 1975 1980 

Lys Gin Gin Leu Ser Leu Ser Leu Gin Ser Glu Leu Gin He Tyr Glu 
1985 1990 1995 2000 

Ser Leu Tyr Glu Asn Pro Lys Lys Gly Leu Lys Ala Phe Ser Leu Asp 

2005 2010 2015 

Ser Cys Tyr Gin Val Pro Gly Glu Leu Ser Cys Leu Val Ala Glu He 

2020 2025 2030 

Arg Ala Leu Arg Val Gin Leu Glu Gin Ser He Gin Val Asn Asn Arg 

2035 2040 2045 

Leu Arg Leu Gin Leu Glu Gin Gin Met Asp His Gly Ala Gly Lys Ala 

2050 2055 2060 

Ser Leu Ser Ser Cys Pro Val Asn Gin Ser Phe Ser Ala Lys Ala Glu 
2065 2070 2075 2080 

Leu Ala Asn Gin Gin Pro Pro Phe Gin Gly Ser Ala Ala Ser Pro Pro 

2085 2090 2095 

Val Arg Asp Val Gly Leu Asn Ser Pro Pro Val Val Leu Pro Ser Asn 

2100 2105 2110 

Ser Cys Ser Val Pro Gly Ser Asp Ser Ala He He Ser Arg Thr Asn 

2115 2120 2125 

Asn Gly Ser Asp Glu Ser Ala Ala Thr Lys Thr Pro Pro Lys Met Glu 

2130 2135 2140 

Val Asp Ala Ala Asp Gly Pro Phe Ala Ser Gly His Gly Arg His Val 
2145 2150 2155 2160 

He Gly His Val Asp Asp Tyr Asp Ala Leu Gin Gin Gin He Gly Glu 

2165 2170 2175 

Gly Lys Leu Leu He Gin Lys He Leu Ser Leu Thr Arg Pro Ala Arg 

2180 2185 2190 

Ser Val Pro Ala Leu Asp Ala Gin Gly Thr Glu Ala Pro Gly Thr Lys 

2195 2200 2205 

Ser Val His Glu Leu Arg Ser Ser Ala Arg Ala Leu Asn His Ser Leu 

2210 2215 2220 

Glu Glu Ser Ala Ser Leu Leu Thr Met Phe Trp Arg Ala Ala Leu Pro 
2225 2230 2235 2240 

Asn Ser His Gly Ser Val Leu Val Gly Glu Glu Gly Asn Leu Met Glu 

2245 2250 2255 

Lys Glu Leu Leu Asp Leu Arg Ala Gin Val Ser Gin Gin Gin Gin Leu 

2260 2265 2270 

Leu Gin Ser Thr Ala Val Arg Leu Lys Thr Ala Asn Gin Arg Lys Lys 

2275 2280 2285 

Ser Met Glu Gin Phe He Val Ser His Leu Thr Arg Thr His Asp Val 

2290 2295 2300 

Leu Lys Lys Ala Arg Thr Asn Leu Glu Met Lys Ser Phe Arg Ala Leu 
2305 2310 2315 2320 

Met Cys Thr Pro Ala Leu 
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2325 

<210> 2 
<211> 9679 
<212> DNA 
<213> rat 

<220> 

<221> misc_f eature 
<222> (1) . . . (9679) 
<223> n = A,T,C or G 

<400> 2 

ccggtcccct ttggtagtag tatctcagag ctcgccccat agtttcatag ttcatgtctg 60 

gtttgttctt atgctttccc cagagcttcg agacagcctt tgagtccacc agcttgaata 120 

tgcccttttc tctctgagtc catttaatat acctgggaca agtattttta tcttgaagca 180 

gatctaaaag aaactcccac agataggttg tgtttccttt tccttctctg gctttcttct 240 

tgactcctaa ctcaggagac ccattggaaa ctggtgactg ctgggtcttt ggtttacggc 300 

caactttctt ctttttcatt ggttcgtggc tgtctggtga agtatggata ggcgaggcat 360 

ccattggttc agactcctct gttgacacct ccactacagt ctccgtaatg acatctggcc 420 

tcatcgcagc atggataaaa tcggattctt gaatcctcaa gcaggtagga gactccatat 4 80 

gaagcagggc ttcagcagct tcaatggtct tatccgtaca gtgtgcatta ctgctgtgaa 540 

ctgatgcttc cacggcccgg atgagcagat ccagctggtt cgtgggtccc tcatgcagag 600 

acgtcgccat gtttatcccg gggctggaac tgctgcttca cattgactta caccctgagc 660 

agcggcgaca ggggagaagg cggaacccgc ggccggagac acacgccgtg cgggcggcac 720 

acactcacgc actcgcacac actccgacgc ccggatcctt gcgcgtcctc cgacaggaag 780 

cggcggccgg cccgcctccc gccgccgggc tgagcagccc caccacctaa cggcaggggc 840 

ggcggcggcc ccggctggca acgcgatcct tccgccccgc gcccagacag gaagtcccgg 900 

gcgccggcag ccagcggccg cacggacacc tgaggctggg gagcccgcag gccgccctcg 960 

gggacgcggg cctcggcagg aaaaggcgcg cttcacgttc tgcggaagcg aagtctgcaa 1020 

atgtcccctc agcatggtct tcctcctggc tcaatctgtc tcaccttcag gtgatcctag 1080 

gactggggct cctttccagg tccccagttt ctcaagtcga tcttctacct ccctcttgat 1140 

tttctactcc attgctggaa agctccagaa cagagcctcc gccgccaacc actgctgatg 1200 

ccatcgcgtc ttccctgagc aagtttcgaa cgctgcgaat caatgtaatt acggctcaga 1260 

tgattgccag ggttatcggt ttcatgttct aattcaatag tgatggagta gacatccaga 1320 

agtccagtct tctaaagatg attaaccaga gggtagtttg acggttaagt agtctaagca 1380 

tccttcaccg tttccacact cccaagagct gaactctaaa ccagcagctc tctggagcta 1440 

ctgctctccc tccacgtcgc cgtgtccctt gcccttcccc tcagggccgc agaccggccg 1500 

agccgccgca gccgcccgcc gttggcccgg cgtcctgcgg gaagccgagg gggcctcccc 1560 

gggccaccgc gcgagccgct ccgcaccaca ggacgagaca aaccgcggct atgtcgcctt 1620 

agccctcggg gtcccacagc ctcagcagcg tcctagcctg cccgctccat gccacggcaa 1680 

ggctgcaccg tgttccaggg gtgaaggggg cgatcgggca tgctcctccc catgggtcgc 1740 

ccaccatgtc taatggatat cgcactctgt cccagcacct caatgacctg aagaaggaga 1800 

acttcagcct caagctgcgc atctacttcc tggaggagcg catgcaacag aagtatgaag 1860 

tcagccggga ggacgtctac aagcggaaca ttgagctgaa ggttgaagtg gagagcctga 1920 

aacgagagct ccaggacagg aaacagcatc tacataaaac atgggccgat gaggaggatc 1980 

tcaacagcca gaatgaagca gagctccggc gccaggttga agaaccgcag caggagacag 2040 

aacacgttta tgagctccta gacaacaaca ttcagctgct gcaggaggaa tccaggtttg 2100 

caaaggatga agccacacag atggagactc tggtggaggc agagaagggg tgtaatctgg 2160 

agctctcaga gaggtggaag gatgctacca agaacaggga agatgcaccg ggagaccagg 2220 

tgaagcttga ccaatattct gcggcactgg ctcagaggga caggagaatt gaagagctga 2280 

ggcagagctt ggctgcccag gaggggcttg tggaacagct gtctcgagag aaacaacaac 2340 

tgttacatct gctggaggag cctgggggca tggaagtgca gcccatgcct aaagggttac 2400 

ccacgcaaca aaagccagac ctaaatgaga cccctacaac ccagccatct gtgtctgatt 2460 

cccacctggc agaactccag gacaaaatcc agcaaacaga ggtcaccaac aagattcttc 2520 

aagagaaact gaatgacatg agctgtgagc tcagatctgc acaggagtcg tctcagaagc 2580 

aagatacgac aatccaaagc ctcaaggaaa tjgctaaagag cagggaaagt gagactgaag 2640 

agctgtacca ggtgattgaa ggtcaaaatg acacaatggc aaagcttccg gaaatgctac 2700 

accagagcca gctcggacag ctccagagct cagagggcat tgcccctgct cagcagcaag 2760 

tggccctgct tgaccttcag agtgctctgt tctgcagcca gcttgaaatc cagaagctcc 2820 

agaggctgtt acgccagaaa gagcgtcagc tggctgacgg caagcggtgc atgcaatttg 2880 

tggaggctgc agcacaggag agagagcagc agaaggaagc tgcttggaaa cataaccagg 2940 

aattacgaaa agctttgcaa cacctccaag gagaactgca cagtaagagc caacagctcc 3000 
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acgttctgga ggcagaaaaa tataatgaaa ttcgaaccca gggacaaaac attcaacacc 3060 
taagtcacag tctgagtcac aaagagcagc taattcagga acttcaggag ctcctacagt 3120 
atcgggatac cacagacaaa actctagaca caaatgaggt gtttcttgag aaactacggc 3X80 
aacgaataca agaccgggca gttgctctag agcgggttat agatgaaaag ttctctgctc 3240 
tagaagaaaa ggacaaggaa ctgcggcagc tccggcttgc tgtgagggac cgagaccatg 3300 
acttagagag actgcgttgt gtcctgtctg ccaatgaagc taccatgcaa agtatggaga 3360 
gtctcctgag ggccagaggc ctggaagtgg agcagttaat tgccacctgc caaaacctcc 3420 
agtggttgaa ggaagaattg gaaaccaagt ttggccactg gcagaaggaa caggagagca 3480 
tcattcagca gttacagaca tctctgcatg acaggaacaa agaagtagag gatctcagtg 3540 
caactttgct ccacaaactt ggacccggcc agagtgaagt agctgaggag ctgtgccagc 3600 
gcctgcagcg gaaggaaagg gtgctgcagg accttctgag tgatcggaac aaacaagcca 3660 
tggagcacga gatggaggtc cagggactgc tccagtcgat gggcacccgg gaacaggaaa 3720 
gacaggctgt tgcagaaaaa atggtacaag ccttcatgga aagaaactcg gaattacagg 3780 
ccctgcggca gtatctaggg gggaaggaat taatggcagc atctcaggca ttcatctcta 3840 
accaaccagc tggagcgact tctgtaggcc cccaccatgg agagcaaact gaccaaggtt 3900 
ctacgcagat gccctctcga gacgacagca cctcgctgac tgccagagag gaggccagca 3960 
taccccggtc tacattagga gactcagaca cagttgcaog gctggagaaa gaactgagca 4020 
atgccaagga ggagcttgag ctcatggcca aaaaagaaag agaaagccag atagaattgt 4080 
ctgccctgca gtccatgatg gctgtgcaag aggaagagct gcaggtgcag gctgctgact 4140 
tggagtccct gaccaggaac atacagataa aagaagacct cataaaggac ctgcaaatgc 4200 
aactggttga ccctgaagat atgccagcca tggagcgcct gacccaagag gtcttacttc 4260 
ttcgggaaaa agttgcttca gtggaacccc agggtcagga agggtcagag aacaggagac 4320 
aacagttgct gctgatgtta gaaggactag tggatgaacg gagtcggctc aacgaggccc 4380 
tgcaagctga gcggcagctc tacagcagcc tggtcaagtt ccatgcccaa ccagagatct 4440 

ctgagagaga ccgaactctg caggtggaac tggaaggggc ccaggtgtta cgcagtcgac 4500 

tagaagaagt tcttggaaga agcctggagc gcttaagcag gctggagacc ctggccgcca 4560 

ttggaggtgc tactgcaggc gatgagactg aagatacaag cacacagttc acagacagca 4620 

ttgaggagga ggctgcacac aacagccacc agcaactcat caaggtgtct ttggagaaaa 4680 

gcctgaccac catggagacc cagaacacat gtcttcagcc cccttcccca gtaggagagg 4740 

atggtaacag gcatcttcag gaagaaatgc tccacctgag ggctgaaatc caccagccct 4800 

tagaagagaa gagaaaagct gaggcagaac tcaaggagct aaaggctcaa attgaggaag 4860 

caggattctc ctctgtgtcc cacatcagga acaccatgct gagcctttgc ctttgccttg 4920 

agaatgcaga gctgaaagag cagatgggag aagcaatgtc tgatggatgg gaggtggagg 4980 

aagacaagga gaagggcgag gtgatggtgg agaccgtggt ggccaaaggg ggtctgagtg 5040 

aggacagcct tcaggctgag ttcaggaaag tccaggggag actcaagagt gcctacaaca 5100 

tcatcaacct cctcaaagag cagctggtcc tgagaagctc ggaagggaac actaaggaga 5160 

tgccagagtt cctcgtgcgc ctggccaggg aggtggacag aatgaacatg ggcttgcctt 5220 

cctcggagaa gcatcaacac caagaacagg agaatatgac cgcaaggcct ggccccaggc 5280 

cccagagtct caagcttggg acagctctct cagtagacgg ctaccaactg gagaacaagt 5340 

cccaggccca agactctgga catcagccag aatttagcct accagggtcc accaaacacc 5400 

tgcgctccca gctggctcag tgtagacaac ggtaccaaga tctccaggag aagctgctca 5460 

tctcagaagc cactgtgttt gcccaggcaa accagctaga gaagtacaga gccatattaa 5520 

gtgaatccct ggtgaagcag gacagcaagc agatccaggt ggaccttcag gacctgggct 5580 

atgagacttg tggccgaagt gagaatgaag ctgaacgtga ggagaccacc agccctgagt 5640 

gtgaggagca cggtaacctg aagcctgtgg tgctggtgga aggcttgtgc tctgagcaag 5700 

ggtacctgga ccctgtcttg gtcagctcac ctgtgaagaa cccttggaga acaagccagg 5760 

aagccagaag aatccaggca caaggaactt cagacaacag ctctctcctg aggaaggaca 5820 

tccgaaatct gaaagcccag ctaccgaatg cctacaaggt ccttcagaac ctgaggagcc 5880 

gggtccggtc cctgtctgcc acaagcgatt actcatcgag tctggagaga ccccgcaagc 5940 

tgatagccgt ggcaaccctt gagggggcct caccccacag tgtcactgat gaagacgaag 6000 

gcttgttgtc agatggcacc ggggcttttt accctccagg gctccaggcc aaaaagaatc 6060 

tagagaatct catccagaga gtatcccagc tggaggccca gctccccaaa actggactag 6120 

aagggaagct ggctgaagaa ctgaagtccg cctcgtggcc tggaaaatac gattctttga 6180 

ttcaggatca ggcccgaaaa actgtcatat ctgcgtccga aaatacnaaa agggagaagg 6240 

atttgttttc ttctcaccca acattcgaaa gatacgtcaa atcttttgaa gacctcctga 6300 

ggaacaacga cttgactact tacctgggcc agagcttccg ggaacaactt agttcaaggc 6360 

gttcagtgac agacaggctg accagcaaat tcagcacaaa ggatcataag agtgaaaaag 6420 

aagaagttgg gcttgagcca ctggccttca ggttcagcag ggaattacag gagaaagaga 6480 

aagtgattga agtcctgcag gccaaggtgg atacccggtt tttctcaccc cccagcagcc 6540 

atgctgcgtc tgagtcccac cgttgtgcca gcagcacatc tttcctgtcg gatgacatag 6600 

aagcctgctc tgacatggac gtagccagcg agtacacaca ctatgaagag aagaagccct 6660 

cacccagtaa ctcagcagcc agtgcatctc aggggcttaa gggcgagccc agaagcagct 6720 

ccatcagctt gccaactccc cagaaccccc ctaaggaggc cagccaggcc cagccaggct 6780 
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ttcactttaa ctccataccc aagccggcta gcctttccca ggcaccaatg cacttcactg 6840 

tacccagctt catgcctttc ggcccctctg ggcctcccct tcttggttgc tgtgagacac 6900 

cagtggtgtc cttggctgag gctcaacaag agctgcagat gctgcagaag cagctgggac 6960 

gaagtgttag cattgcccct cccacctcca catccacgtt gcttagcaac cacacagaag 7020 

ctagctctcc ccgctacagc aaccctgctc agccccactc cccagcaagg ggcaccatag 7080 

agctgggcag aatcctggag cctggatacc tgggcagcgg ccagtgggac atgatgaggc 7140 

ctcagaaagg gagcatctct ggggagctgt cctcaggctc ctcgatgtac cagcttaact 7200 

ccaaacccac aggggccgac ctgttggaag agcatttagg tgagatccgg aacctgcgcc 7260 

agcgcctgga ggagtccata tgtgtcaatg acaggctacg ggagcagctg cagcataggc 7320 

tcagctccac ggcccgagaa aatggttcca cctctcactt ctacagtcag ggcctggagt 7380 

ccatgcctca gctctacaat gagaacagag ccctcaggga agaaaaccaa agcctgcaga 7440 

cacggctcag tcatgcttcc aggggacact cccaggaagt ggaccacctg agggaggctc 7500 

tgctttcctc aagttcccag ctccaggagc tggagaagga gctggagcag cagaaggctg 7560 

agcggcggca gcttctggaa gacttgcagg agaagcagga tgagatcgtg catttccgag 7620 

aggagaggct gtccctccag gaaaacaact ccaggctgca gcacaagctg gccctcctgc 7680 

aacaacagtg tgaggagaaa cagcagctct ccctgtccct gcagtcagag ctccagatct 7740 

acgagtccct ctacgaaaat cctaagaagg gcttgaaagc cttcagccta gattcctgtt 7800 

accaagtccc gggtgagttg agctgcctgg tggcagagat tcgagctctg agagtgcagt 7860 

tggagcagag cattcaagtg aacaaccgtc tgcggctgca gctggaacag cagatggatc 7920 

acggtgctgg caaagccagt ctcagttcct gccctgttaa ccagagcttc tcagccaagg 7980 

cggagctggc aaaccagcag ccacccttcc aaggttcagc tgcttcccct ccagtccggg 8040 

acgttggctt gaattctcca cccgtggtcc tccccagcaa ttcgtgctct gttcctggct 8100 

cagactctgc catcatcagt aggacaaaca atggttcgga tgagtctgca gcaacgaaga 8160 

cccctcccaa gatggaggtc gatgctgctg atggcccatt tgccagtgga cacggcagac 8220 

acgtcatcgg ccatgtggat gactacgacg ccctacagca gcagattggg gaagggaagc 8280 

tgctgatcca aaagatactg tctctcacga ggccagcacg cagcgtccct gcactggacg 8340 

cgcagggcac agaggcacca ggtaccaaaa gtgtccatga gcttcggagc agcgccaggg 8400 

ctctgaacca cagcctagaa gagtcagctt ccctcctcac catgttctgg agagcagctt 8460 

tgccaaactc tcatggttct gtactggtag gcgaagaggg aaacctgatg gagaaagaac 8520 

tcctagacct gcgagcccaa gtgtcccaac agcaacagct ccttcagagc actgctgtgc 8580 

gtctgaagac ggccaaccag aggaagaaaa gcatggagca gttcatcgtg agccatctga 8640 

ccaggaccca tgatgtcttg aagaaagcac ggactaattt agagatgaaa tccttcaggg 8700 

ccctgatgtg cactccagcc ttgtgaccct tgccttccag gagccacata aaaggcgaag 8760 

ccaggagtcc ttaaaacagc aggaagggtg ggcctgcccg cccctagtac agctgcctgt 8820 

ctgctgagga atacctggtc cgactcctcc cctgctggag ctccagggaa gggctcatat 8880 

atgtgtccac atgggacagg caggaaggaa agtggcatcc tgacaatgaa tatgattagc 8940 

caaggcccac tgggcccatc actaagcaaa actcatgtag actgtgtaga aggccccccg 9000 

gcactgcttc tagacagcct cagcagcacg gtgcccacct cgttacagtt ctcacctcaa 9060 

gatagccaac tcaggggaac taggacctta ccacccacaa acaggatgtg tggtcccaat 9120 

gccaacgctc ctcagacagt tgtaaaagca cacatcattg agtggcagcg' tccagccgga 9180 

cactgttgga gactaccaaa cccctcactg acccagtctt gggccaggcc agctctgtgg 9240 

gccaagtctg gtagtacttt ggtctctacc accacaccag agagagtcta- tatagcaaat 9300 

gtggtaactt gtaggtgccc tgcacttagc ctagcacctt ctgtttctta cgtgatctca 9360 

agttgaacca acttccttaa ctctgctgtc ccctgaatcc taacttccct caggggaatt 9420 

ggagattggt ggccacatca tgcctattga atgtttagtg aacagcatat cggtgcctct 948a 

taatggcatg ggcaaggcct gctctgtact gaagactgtg tcttcacagt gctcatagga 9540 

cgtgggtgtg tgtataaatg tataatatag atttatatat gtcgctatgg ctatgtgttg 9600 

aaggccagca taagtgcaga gcgatgggtg agaagacgct aagcagtctt tcttatggct 9660 

attaaagcta actgtgtac 9679 



<210> 3 
<211> 6981 
<212> DNA 
<213> rat 

<220> 

<221> misc_feature 
<222> (11. (6981) 
<223> n = A,T,C or G 

<400> 3 



-8- 



wo 00/27861 



PCT/US99/26860 



atgtctaatg gatatcgcac tctgtcccag cacctcaatg acctgaagaa ggagaacttc 60 

agcctcaagc tgcgcatcta cttcctggag gagcgcatgc aacagaagta tgaagtcagc 120 

cgggaggacg tctacaagcg gaacattgag ctgaaggttg aagtggagag cctgaaacga 180 

gagctccagg acaggaaaca gcatctacat aaaacatggg ccgatgagga ggatctcaac 240 

agccagaatg aagcagagct ccggcgccag gttgaagaac cgcagcagga gacagaacac 300 

gtttatgagc tcctagacaa caacattcag ctgctgcagg aggaatccag gtttgcaaag 360 

gatgaagcca cacagatgga gactctggtg gaggcagaga aggggtgtaa tctggagctc 420 

tcagagaggt ggaaggatgc taccaagaac agggaagatg caccgggaga ccaggtgaag 480 

cttgaccaat attctgcggc actggctcag agggacagga gaattgaaga gctgaggcag 540 

agcttggctg cccaggaggg gcttgtggaa cagctgtctc gagagaaaca acaactgtta 600 

catctgctgg aggagcctgg gggcatggaa gtgcagccca tgcctaaagg gttacccacg 660 

caacaaaagc cagacctaaa tgagacccct acaacccagc catctgtgtc tgattcccac 720 

ctggcagaac tccaggacaa aatccagcaa acagaggtca ccaacaagat tcttcaagag 780 

aaactgaatg acatgagctg tgagctcaga tctgcacagg agtcgtctca gaagcaagat 840 

acgacaatcc aaagcctcaa ggaaatgcta aagagcaggg aaagtgagac tgaagagctg 900 

taccaggtga ttgaaggtca aaatgacaca atggcaaagc ttccggaaat gctacaccag 960 

agccagctcg gacagctcca gagctcagag ggcattgccc ctgctcagca gcaagtggcc 1020 

ctgcttgacc ttcagagtgc tctgttctgc agccagcttg aaatccagaa gctccagagg 1080 

ctgttacgcc agaaagagcg tcagctggct gacggcaagc ggtgcatgca atttgtggag 1140 

gctgcagcac aggagagaga gcagcagaag gaagctgctt ggaaacataa ccaggaatta 1200 

cgaaaagctt tgcaacacct ccaaggagaa ctgcacagta agagccaaca gctccacgtt 1260 

ctggaggcag aaaaatataa tgaaattcga acccagggac aaaacattca acacctaagt 1320 

cacagtctga gtcacaaaga gcagctaatt caggaacttc aggagctcct acagtatcgg 1380 

gataccacag acaaaactct agacacaaat gaggtgtttc ttgagaaact acggcaacga 1440 

atacaagacc gggcagttgc tctagagcgg gttatagatg aaaagttctc tgctctagaa 1500 

gaaaaggaca aggaactgcg gcagctccgg cttgctgtga gggaccgaga ccatgactta 1560 

gagagactgc gttgtgtcct gtctgccaat gaagctacca tgcaaagtat ggagagtctc 1620 

ctgagggcca gaggcctgga agtggagcag ttaattgcca cctgccaaaa cctccagtgg 1680 

ttgaaggaag aattggaaac caagtttggc cactggcaga aggaacagga gagcatcatt 1740 

cagcagttac agacatctct gcatgacagg aacaaagaag tagaggatct cagtgcaact 1800 

ttgctccaca aacttggacc cggccagagt gaagtagctg aggagctgtg ccagcgcctg I860 

cagcggaagg aaagggtgct gcaggacctt ctgagtgatc ggaacaaaca agccatggag 1920 

cacgagatgg aggtccaggg actgctccag tcgatgggca cccgggaaca ggaaagacag 1980 

gctgttgcag aaaaaatggt acaagccttc atggaaagaa actcggaatt acaggccctg 2040 

cggcagtatc taggggggaa ggaattaatg gcagcatctc aggcattcat ctctaaccaa 2100 

ccagctggag cgacttctgt aggcccccac catggagagc aaactgacca aggttctacg 2160 

cagatgccct ctcgagacga cagcacctcg ctgactgcca gagaggaggc cagcataccc 2220 

cggtctacat taggagactc agacacagtt gcagggctgg agaaagaact gagcaatgcc 2280 

aaggaggagc ttgagctcat ggccaaaaaa gaaagagaaa gccagataga attgtctgcc 2340 

ctgcagtcca tgatggctgt gcaagaggaa gagctgcagg tgcaggctgc tgacttggag 2400 

tccctgacca ggaacataca gataaaagaa gacctcataa aggacctgca aatgcaactg 2460 

gttgaccctg aagatatgcc agccatggag cgcctgaccc aagaggtctt acttcttcgg 2520 

gaaaaagttg cttcagtgga accccagggt caggaagggt cagagaacag gagacaacag 2580 

ttgctgctga tgttagaagg actagtggat gaacggagtc ggctcaacga ggccctgcaa 2640 

gctgagcggc agctctacag cagcctggtc aagttccatg cccaaccaga gatctctgag 2700 

agagaccgaa ctctgcaggt ggaactggaa ggggcccagg tgttacgcag tcgactagaa 2760 

gaagttcttg gaagaagcct ggagcgctta agcaggctgg agaccctggc cgccattgga 2820 

ggtgctactg caggcgatga gactgaagat acaagcacac agttcacag^ cagcattgag 2880 

gaggaggctg cacacaacag ccaccagcaa ctcatcaagg tgtctttgga gaaaagcctg 2940 

accaccatgg agacccagaa cacatgtctt cagccccctt ccccagtagg agaggatggt 3000 

aacaggcatc ttcaggaaga aatgctccac ctgagggctg aaatccacca gcccttagaa 3060 

gagaagagaa aagctgaggc agaactcaag gagctaaagg ctcaaattga ggaagcagga 3120 
ttctcctctg tgtcccacat caggaacacc atgctgagcc tttgcctttg ccttgagaat * 3180 

gcagagctga aagagcagat gggagaagca atgtctgatg gatgggaggt ggaggaagac 3240 

aaggagaagg gcgaggtgat ggtggagacc gtggtggcca aagggggtct gagtgaggac 3300 

agccttcagg ctgagttcag gaaagtccag gggagactca agagtgccta caacatcatc 3360 

aacctcctca aagagcagct ggtcctgaga ^gctcggaag ggaacactaa ggagatgcca 3420 

gagttcctcg tgcgcctggc cagggaggtg gacagaatga acatgggctt gccttcctcg 3480 

gagaagcatc aacaccaaga acaggagaat atgaccgcaa ggcctggccc caggccccag 3540 

agtctcaagc ttgggacagc tctctcagta gacggctacc aactggagaa caagtcccag 3600 

gcccaagact ctggacatca gccagaattt agcctaccag ggtccaccaa acacctgcgc 3660 

tcccagctgg ctcagtgtag acaacggtac caagatctcc aggagaagct gctcatctca 3720 

gaagccactg tgtttgccca ggcaaaccag ctagagaagt acagagccat attaagtgaa 3780 
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tccctggtga 
acttgtggcc 
gagcacggta 
ctggaccctg 
agaagaatcc 
aatctgaaag 
cggtccctgt 
gccgtggcaa 
ttgtcagatg 
aatctcatcc 
aagctggctg 
gatcaggccc 
ttttcttctc 
aacgacttga 
gtgacagaca 
gttgggcttg 
attgaagtcc 
gcgtctgagt 
tgctctgaca 
agtaactcag 
agcttgccaa 
tttaactcca 
agcttcatgc 
gtgtccttgg 
gttagcattg 
tctccccgct 
ggcagaatcc 
aaagggagca 
cccacagggg 
ctggaggagt 
tccacggccc 
cctcagctct 
ctcagtcatg 
tcctcaagtt 
cggcagcttc 
aggctgtccc 
cagtgtgagg 
tccctctacg 
gtcccgggtg 
cagagcattc 
gctggcaaag 
ctggcaaacc 
ggcttgaatt 
tctgccatca 
cccaagatgg 
atcggccatg 
atccaaaaga 
ggcacagagg 
aaccacagcc 
aactctcatg 
gacctgcgag 
aagacggcca 
acccatgatg 
atgtgcactc 



agcaggacag 
gaagtgagaa 
acctgaagcc 
tcttggtcag 
aggcacaagg 
cccagctacc 
ctgccacaag 
cccttgaggg 
gcaccggggc 
agagagtatc 
aagaactgaa 
gaaaaactgt 
acccaacatt 



ctacttacct 
ggctgaccag 
agccactggc 
tgcaggccaa 
cccaccgttg 
tggacgtagc 
cagccagtgc 
ctccccagaa 
tacccaagcc 
ctttcggccc 
ctgaggctca 
cccctcccac 
acagcaaccc 
tggagcctgg 
tctctgggga 
ccgacctgtt 
ccatatgtgt 
gagaaaatgg 
acaatgagaa 
cttccagggg 
cccagctcca 
tggaagactt 
tccaggaaaa 
agaaacagca 
aaaatcctaa 
agttgagctg 
aagtgaacaa 
ccagtctcag 
agcagccacc 
ctccacccgt 
tcagtaggac 
aggtcgatgc 
tggatgacta 
tactgtctct 
caccaggtac 
tagaagagtc 
gttctgtact 
cccaagtgtc 
accagaggaa 
tcttgaagaa 
cagccttgtg 



caagcagatc 
tgaagctgaa 
tgtggtgctg 
ctcacctgtg 
aacttcagac 
gaatgcctac 
cgattactca 
ggcctcaccc 
tttttaccct 
ccagctggag 
gtccgcctcg 
catatctgcg 
cgaaagatac 
gggccagagc 
caaattcagc 
cttcaggttc 
ggtggatacc 
tgccagcagc 
cagcgagtac 
atctcagggg 
cccccctaag 
ggctagcctt 
ctctgggcct 
acaagagctg 
ctccacatcc 
tgctcagccc 
atacctgggc 
gctgtcctca 
ggaagagcat 
caatgacagg 
ttccacctct 
cagagccctc 
acactcccag 
ggagctggag 
gcaggagaag 
caactccagg 
gctctccctg 
gaagggcttg 
cctggtggca 
ccgtctgcgg 
ttcctgccct 
cttccaaggt 
ggtcctcccc 
aaacaatggt 
tgctgatggc 
cgacgcccta 
cacgaggcca 
caaaagtgtc 
agcttccctc 
ggtaggcgaa 
ccaacagcaa 
gaaaagcatg 
agcacggact 



caggtggacc 
cgtgaggaga 
gtggaaggct 
aagaaccctt 
aacagctctc 
aaggtccttc 
tcgagtctgg 
cacagtgtca 
ccagggctcc 
gcccagctcc 
tggcctggaa 
tccgaaaata 
gtcaaatctt 
ttccgggaac 
acaaaggatc 
agcagggaat 
cggtttttct 
acatctttcc 
acacactatg 
cttaagggcg 
gaggccagcc 
tcccaggcac 
ccccttcttg 
cagatgctgc 
acgttgctta 
cactccccag 
agcggccagt 
ggctcctcga 
ttaggtgaga 
ctacgggagc 
cacttctaca 
agggaagaaa 
gaagtggacc 
aaggagctgg 
caggatgaga 
ctgcagcaca 
tccctgcagt 
aaagccttca 
gagattcgag 
ctgcagctgg 
gttaaccaga 
tcagctgctt 
agcaattcgt 
tcggatgagt 
ccatttgcca 
cagcagcaga 
gcacgcagcg 
catgagcttc 
ctcaccatgt 
gagggaaacc 
cagctccttc 
gagcagttca 
aatttagaga 



ttcaggacct 
ccaccagccc 
tgtgctctga 
ggagaacaag 
tcctgaggaa 
agaacctgag 
agagaccccg 
ctgatgaaga 
aggccaaaaa 
ccaaaactgg 
aatacgattc 
cnaaaaggga 
ttgaagacct 
aacttagttc 
ataagagtga 
tacaggagaa 
caccccc' :ag 
tgtcggatga 
aagagaagaa 
agcccagaag 
aggcccagcc 
caatgcactt 
gttgctgtga 
agaagcagct 
gcaaccacac 
caaggggcac 
gggacatgat 
tgtaccagct 
tccggaacct 
agctgcagca 
gtcagggcct 
accaaagcct 
acctgaggga 
agcagcagaa 
tcgtgcattt 
agctggccct 
cagagctcca 
gcctagattc 
ctctgagagt 
aacagcagat 
gcttctcagc 
cccctccagt 
gctctgttcc 
ctgcagcaac 
gtggacacgg 
ttggggaagg 
tccctgcact 
ggagcagcgc 
tctggagagc 
tgatggagaa 
agagcactgc 
tcgtgagcca 
tgaaatcctt 



gggctatgag 

tgagtgtgag 

gcaagggtac 

ccaggaagcc 

ggacatccga 

gagccgggtc 

caagctgata 

cgaaggcttg 

gaatctagag 

actagaaggg 

tttgattcag 

gaaggatttg 

cctgaggaac 

aaggcgttca 

aaaagaagaa 

agagaaagtg 

cagccatgct 

catagaagcc 

gccctcaccc 

cagctccatc 

aggctttcac 

cactgtaccc 

gacaccagtg 

gggacgaagt 

agaagctagc 

catagagctg 

gaggcctcag 

taactccaaa 



gcgccagcgc 
taggctcagc 
ggagtccatg 
gcagacacgg 
ggctctgctt 
ggctgagcgg 
ccgagaggag 
cctgcaacaa 
gatctacgag 
ctgttaccaa 
gcagttggag 
ggatcacggt 
caaggcggag 
ccgggacgtt 
tggctcagac 
gaagacccct 
cagacacgtc 
gaagctgctg 
ggacgcgcag 
cagggctctg 
agctttgcca 
agaactccta 
tgtgcgtctg 
tctgaccagg 
cagggccctg 



3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5260 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

6981 



<210> 4 
<211> 9241 
<212> DNA 
<213> human 

<400> 4 
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ggatccttga gggcactggt gcgactttca ggtgaggtct tagcagatga aagcggctgg 60 

ctgtggcccg cgccagtagt gctttctgct ccgcactcgc cgtgagccag gtgtgcaacc 120 

ggatttgggg cgagggtcgc gctggctacc tcgcatgcgc agagccggaa gcccgctgac 180 

cggactacag ctcccagaag agccttgtgg aggccgcaga cgcgaagccg ctggcgccat 240 

cttgaaatct gatcctccat ccccgaggct ttgcgtctgc gcggccggcc gctgctgctc 300 

cgggagccca gtctgctaaa aggggaggac gttgaggacg cggcggctgg cgggagagac 360 

agctggggag agacatggca gggtcggagc gcggcctgcg cctctgtcac tcagcatcct 420 

cttaggcgtt tccacgcccg ccccctgccc gaggggcggg gctgacggct ctggtacccg 480 

gagtcggcgc gcggggcagg ggcgcgcccc tgcagagtgg ggaccccact gggctgtgcc 540 

atgctgaccg gagaccaccg aggcgggaga cagagcgcgg cgaagagcca ttgagtggtc 600 

acccagtagc cgccgccgcc gccgcctcgg gaagcttgcc acccgctagg agggaagatg 660 

aaggagattt gcaggatctg tgcccgagag ctgtgtggaa accagcggcg ctggatcttc 720 

cacacggcgt ccaagctcaa tctccaggtt ctgctttcgc acgtcttggg caaggatgtc 780 

ccccgcgatg gcaaagccga gttcgcttgc agcaagtgtg ctttcatgct tgatcgaatc 840 

tatcgattcg acacagttat tgcccggatt gaagcgcttt ctattgagcg cttgcaaaag 900 

ctgctactgg agaaggatcg cctcaagttc tgcattgcca gtatgtatcg gaagaataac 960 

gatgactctg gcgcggagat caaggcgggg aatgggacgg ttgacatgtc cgtcttaccc 1020 

gatgcgagat actctgcact gctccaggag gacttcgcct attcagggtt tgagtgctgg 1080 

gtggagaatg aggatcagat ccaggagcca cacagctgcc atggttcaga aggccctgga 1140 

aaccgaccca ggagatgccg tggttgtgcc gctttgcggg ttgctgattc tgactatgaa 1200 

gccatttgta aggtacctcg aaaggtggcc agaagtatct cctgcggccc ttctagcagg 1260 

tggtcgacca gcatttgcac tgaagaacca gcgttgtctg aggttgggcc acccgactta 1320 

gcaagcacaa aggtaccccc agatggagaa agcatggagg aagagacgcc tggttcctct 1380 

gtggaatctt tggatgcaag cgtccaggct agccctccac aacagaaaga tgaggagact 1440 

gagagaagtg caaaggaact tggaaagtgt gactgttgtt cagatgatca ggctccgcag 1500 

catgggtgta atcacaagct ggaattagct cttagcatga ttaaaggtct tgattataag 1560 

cccatccaga gcccccgagg gagcaggctt ccgattccag tgaaatccag cctacctgga 1620 

gccaagcctg gccctagcat gacagatgga gttagttccg gtttccttaa caggtctttg 1680 

aaaccccttt acaagacacc tgtgagttat cccttggagc tttcagacct gcaggagctg 1740 

tgggatgatc tctgtgaaga ttatttgccg ctccgggtcc agcccatgac tgaagagttg 1800 

ctgaaacaac aaaagctgaa ttcacatgag accactataa ctcagcagtc tgtatctgat 1860 

tcccacttgg cagaactcca ggaaaaaatc cagcaaacag aggccaccaa caagattctt 1920 

caagagaaac ttaatgaaat gagctatgaa ctaaagtgtg ctcaggagtc gtctcaaaag 1980 

caagatggta caattcagaa cctcaaggaa actctgaaaa gcagggaacg tgagactgag 2040 

gagttgtacc aggtaattga aggtcaaaat gacacaatgg caaagcttcg agaaatgctg 2100 

caccaaagcc agcttggaca acttcacagc tcagagggta cttctccagc tcagcaacag 2160 

gtagctctgc ttgatcttca gagtgcttta ttctgcagcc aacttgaaat acagaagctc 2220 

cagagggtgg tacgacagaa agagcgccaa ctggctgatg ccaaacaatg tgtgcaattt 2280 

gtagaggctg cagcacacga gagtgaacag cagaaagagg cttcttggaa acataaccag 2340 

gaattgcgaa aagccttgca gcagctacaa gaagaattgc agaataagag ccaacagctt 2400 

cgtgcctggg aggctgaaaa atacaatgag attcgaaccc aggaacaaaa catccagcac 2460 

ctaaaccata gtctgagtca caaggagcag ttgcttcagg aatttcggga gctcctacag 2520 

tatcgagata actcagacaa aacccttgaa gcaaatgaaa tgttgcttga gaaacttcgc 2580 

cagcgaatac atgataaagc tgttgctctg gagcgggcta tagatgaaaa attctctgct 2640 

ctagaagaga aagaaaaaga actgcgccag cttcgtcttg ctgtgagaga gcgagatcat 2700 

gacttagaga gactgcgcga tgtcctctcc tccaatgaag ctactatgca aagtatggag 2760 

agtctcctga gggccaaagg cctggaagtg gaacagttat ctactacctg tcaaaacctc 2820 

cagtggctga aagaagaaat ggaaaccaaa tttagccgtt ggcagaagga acaagagagt 2880 

atcattcagc agttacagac gtctcttcat gataggaaca aagaagtgga ggatcttagt 2940 

gcaacactgc tctgcaaact tggaccaggg cagagtgaga tagcagagga gctgtgccag 3000 

cgtctacagc gaaaggaaag gatgctgcag gaccttctaa gtgatcgaaa taaacaagtg 3060 

ctggaacatg aaatggagat tcaaggcctg cttcagtctg tgagcaccag ggagcaggaa 3120 

agccaagctg ctgcagagaa gttggtgcaa gccttaatgg aaagaaattc agaattacag 3180 

gccctgcgcc aatatttagg agggagagac tccctgatgt cccaagcacc catctctaac 3240 

caacaagctg aagttacccc cactggccgt cttggaaaac agactgatca aggttcaatg 3300 

cagatacctt ccagagatga tagcacttca ttgactgcca aagaggatgt cagcataccc 3360 

agatccacat taggagactt ggacacagtt gcagggctgg aaaaagaact gagtaatgcc 3420 

aaagaggaac ttgaactcat ggctaaaaaa gaaagagaaa gtcagatgga actttctgct 3480 

ctacagtcca tgatggctgt gcaggaagaa gagctgcagg tgcaggctgc tgatatggag 3540 

tctctgacca ggaacataca gattaaagaa gatctcataa aggacctgca aatgcaactg 3600 

gttgatcctg aagacatacc agctatggaa cgcctgaccc aggaagtctt acttcttcgg 3660 

gaaaaagttg cttcagtaga atcccagggt caagaaattt caggaaaccg aagacaacag 3720 

ttgctgctga tgctagaagg actagtagat gaacggagtc ggctcaatga ggccttacaa 3780 
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gcagagagac agctctatag cagtctggtg aagttccatg cccatccaga gagctctgag 3840 
agagaccgaa ctctgcaggt ggaactggaa ggggctcagg tgttacgcag tcggctagaa 3900 
gaagttcttg gaagaagctt ggagcgctta aacaggctgg agaccctggc cgccattgga 3960 
ggtgcagctg caggggatga caccgaagat acaagcactg agttcactga cagtattgag 4020 
gaggaggctg cacaccatag tcaccagcaa cttgtcaagg tggctttgga gaaaagtctg 4080 
gcaactgtgg agacccagaa cccatctttt tcccctcctt ctccgatggg aggggacagt 4140 
aacaggtgtc ttcaggaaga aatgctccac ctgagggctg agttccacca gcacttagaa 4200 
gagaagagga aagctgagga ggaactgaag gagctaaagg ctcaaattga ggaagcagga 4260 
ttctcctcag tgtcccacat caggaacacc atgctgagcc tttgccttga gaatgcggag 4320 
ctgaaagagc agatgggaga agcaatgtct gatggatggg agatcgagga agacaaggag 4380 
aagggcgagg tgatggttga gactgtggta accaaagagg gtctgagtga gagtagcctt 4440 
caggctgagt tcagaaagct ccagggaaaa ctgaagaatg cccacaatat catcaacctc 4500 
ctcaaagaac aacttgtgct gagtagcaag gaagggaata gtaaacttac tccagagctc 4560 

cttgtgcatc tgaccagcac cattgaaaga ataaacacag aactggttgg ttcccctggg .4620 
aagcaccaac accaagagga ggggaatgtg actgtgaggc ctttccccag accccagagc 4680 
cttgaccttg gggctacctt cacagtggat gcccaccaat tggataacca gtcccagcct 4740 
cgtgaccctg ggcctcagtc agcgtttagc cvaccagggt ccacccagca cctgcgctcc 4800 
cagctgtcac aatgcaaaca acgctatcaa gatctccagg agaagctgct gctatcagaa 4860 
gccactgtct ttgctcaggc taacgagctg gagaaataca gagttatgct tacaggtgaa 4920 
tccttggtga agcaggacag caagcagatc caggtggacc tccaggacct gggctatgag 4980 

acttgtggcc gaagcgagaa tgaggctgaa cgggaggaaa ccaccagtcc tgagtgtgag 5040 

gagcacaaca gcctcaagga aatggtcctg atggaggggc tgtgctctga gcagggacgc 5100 

cggggctcaa cactggctag ttcctctgag aggaagccct tggagaacca gctagggaag 5160 

caggaagagt tccgggtata tggaaagtca gaaaacatct tggtcctacg aaaggacatc 5220 

aaagatctga aggcccagct gcagaatgcc aacaaggtca ttcaaaacct caagagccgg 5280 

gtccggtccc tctcagttac aagtgattat tcgtctagtc tggaaagacc ccggaagctg 5340 

agagctgttg gcaccttgga ggggtcttca cctcatagtg tccctgatga ggatgagggg 5400 

tggctgtctg atggcactgg ggctttctac tctccagggc ttcaggccaa aaaggacctg 5460 

gagagtctca tccagagagt atcccagctg gaggcccagc tcccaaaaaa tggactagaa 5520 

gagaagctgg ctgaggagct gagatcagcc tcgtggcctg ggaaatatga ttccctgatt 5580 

caggatcagg cccgggaact gtcttaccta cggcaaaaaa tacgagaagg gagaggtatt 5640 

tgttatctta tcacccggca tgcaaaagat acagtaaaat cttttgagga tctcctaagg 5700 

agcaatgaca ttgactacta cctgggacag agcttccggg agcaactcgc ccagggaagc 5760 

cagctgacag agaggctcac cagcaaactc agcaccaagg atcataaaag tgagaaagat 5820 

caagctggac ttgagccact ggccctcagg ctcagcaggg agctgcagga gaaggagaaa 5880 

gtgattgaag tcctgcaggc caagctggat gctcggtccc tcacaccctc cagcagccat 5940 

gccttgtctg actcccaccg ctctcccagc agcacctctt tcctgtctga tgaactggaa 6000 

gcctgctctg acatggacat agtcagcgag tacacacact atgaagagaa gaaagcttct 6060 

cccagtcact cagattccat ccatcattcg agtcattctg ctgtgttgtc ttctaaacca 6120 

tcatcaacca gtgcatctca gggggctaag gccgaatcca acagcaaccc catcagcttg 6180 

ccaactcccc agaatacccc caaggaggcc aaccaggccc attcaggctt tcattttcac 6240 

tccataccca agctggctag ccttcctcag gcaccattgc cctcagctcc atccagcttc 6300 

ctgcctttca gccccactgg ccctctcctc cttggctgct gtgagacacc agtggtctcc 6360 

ttggctgagg ctcagcagga gctacagatg ctgcagaagc agttgggaga aagtgccagc 6420 

actgttcctc ctgcttccac agctacattg ctgagcaacg acttggaagc cgactcttcc 6480 

tactacctca actctgccca gcctcactct cctccaaggg gcaccataga actgggaaga 6540 

atcctagagc ctgggtacct gggcagcagt ggcaagtggg atgtgatgag gcctcagaaa 6600 

gggagtgtat ctggggacct atcctcaggc tcctctgtgt accagcttaa ctccaaaccc 6660 

acaggggctg acctgctgga agagcatctt ggtgaaatcc ggaacctgcg ccagcgcctg 6720 

gaggagtcca tctgcatcaa tgaccgccta cgggagcaac tggaacaccg gctgacctct 6780 

actgctcgtg gaaggggatc cacttctaac ttctacagtc agggcctgga gtccatacct 6840 

cagctctgca atgagaacag agtcctcagg gaagacaatc gaagacttca ggctcaactg 6900 

agtcatgttt ccagagagca ctcccaggaa acagaaagcc tgagggaggc tctgctgtcc 6960 

tctcgatccc accttcaaga gctggaaaag gagctggagc accagaaggt ggaaaggcag 7020 

cagcttttgg aagacttgag ggagaagcag caagaggtct tgcatttcag ggaggaacgt 7080 

ctttccctcc aggaaaacga ctccagtggg ccttgcctct ccctggtcag actgcagcac 7140 

aagctggttc tcctgcagca acagtgtgaa gagaaacagc agctctttga gtccctccag 7200 

tcagagctac aaatctacga ggcactttat ggcaattcca agaaggggct gaaagcttac 7260 

agcctggatg cctgtcacca aatccctttg agcagtgacc tgagccacct ggtggcagag 7320 

gtacgagctc tgagagggca gctggagcag agcattcagg ggaacaattg tctgcgactg 7380 

cagctgcaac agcagctgga gagcggtgct ggcaaagcca gcctcagccc ctcctccatt 7440 

aaccagaact tcccagccag cactgaccct ggaaacaagc agctgctcct ccaagattca 7500 

gctgtgtccc ctccagtccg ggatgttggt atgaattccc cagctctggt cttccccagc 7560 
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tctgcttcct ctactcctgg ctcagaaacg cccataatca acagagcaaa tggcttgggt 7620 

ttggatactt ctccagtaat gaagacccct cccaagctag agggtgatgc tactgatggc 7680 

tcctttgcca ataagcatgg ccgccatgtc attggccaca ttgatgacta cagtgcccta 7740 

agacagcaga^ttgcggaggg caagctgctg gtcaaaaaga tagtgtctct tgtgagatca 7800 

gcgtgcagct tccctggcct tgaagcccaa ggcacagagg tgctaggcag caaaggtatt 7860 

catgagcttc ggagcagcac cagtgccctg caccatgccc tagaggagtc ggcttccctc 7920 

ctcaccatgt tctggagagc agccctgcca agcacccaca tccctgtgct gcctggcaaa 7980 

gtgggagaat caacagaaag ggaacttctg gaactgagaa ccaaagtatc caaacaggag 8040 

cggctccttc agagcacaac tgagcatctg aagaacgcca accagcagaa ggagagcatg 8100 

gagcagttca tcgtcagcca gctaaccaga acacatgatg ttttaaagaa ggcaaggact 8160 

aacttagagg tgaaatccct aagggctctg ccatgtactc cagccttgtg acccttgcct 8220 

tccaggaacc atgcaagaag cgcagccacc agaagtcctt aaaacagcag gaaaggtggg 8280 

cctgtccccc ttttgtgcag ctacctatct gctgaggagc atctgggcct cattcctcca 8340 

agtccacggg agggtccaga agagggagtc agagatgtat cctggtggag ctgggagaaa 8400 

ggcagaaagc ctttctgaca gctatggaat acgattagcc aaggtccact tggcccagca 8460 

ctaagaaaaa gatgcgtagt ttgcacagaa ggttttgtga tcctgcctct caacagcccc 8520 

agcagcttgg gaactagcaa gagcacattt cttgcctcat cagctgtcct gagatgc^.iaa 8580 

actcagtgga tataggaccc tgattccg'at gaaaggggca cgtggtccca atgctggagc 8640 

tcctctggca ggttctaaaa gcacactact gagcagcggt gccctgccgg acactgctgg 8700 

cgg^ggctca gtgagcacta ctcacagatc cacacctgac cctgttgggt cgagtcaggc 8760 

tgggccttgg tctgcactgt agcacctgtg ttctttgagt tcacatcatg aatgtggtga 8820 

cttcccagat accatctcag gcttaaccta gcacatccta tttcttttct tctatgatat 8880 

ccaaattgga ctgacctcac ttcaaagttg ctgtcccatt ttgtcaccct atcttatctc 8940 

ggggaaattg cagactgatg gccagaccaa ctctgttgaa attcttgcat agagcaaacc 9000 

tgtgctcatt tttaagtggc atgggagagg cccccagcct agtaaagcct agtctgtgtc 9060 

ttcacagtgc tggtagaatg tgtttgtgtg tataaatata tgatatagat ttatatatgt 9120 

tgctaacgcc atatattgaa ggccaacata actggtggac agggtgggtg acagaaaatg 9180 

aaagcctttt tggtgattgt taaagcaaga tgtgtataaa gaaataaata gtttttcttt 9240 

c 9241 



<210> 5 

<211> 2517 

<212> PRT 

<213> human 

<400> 5 

Met Lys Glu lie Cys Arg lie Cys Ala Arg Glu Leu Cys Gly Asn Gin 

1 5 10 15 

Arg Arg Trp lie Phe His Thr Ala Ser Lys Leu Asn Leu Gin Val Leu 

20 25 30 

Leu Ser His Val Leu Gly Lys Asp Val Pro Arg Asp Gly Lys Ala Glu 

35 40 45 

Phe Ala Cys Ser Lys Cys Ala Phe Met Leu Asp Arg He Tyr Arg Phe 

50 55 60 

Asp Thr Val He Ala Arg He Glu Ala Leu Ser He Glu Arg Leu Gin 
65 70 75 80 

Lys Leu Leu Leu Glu Lys Asp Arg Leu Lys Phe Cys He Ala Ser Met 

85 90 95 

Tyr Arg Lys Asn Asn Asp Asp Ser Gly Ala Glu He Lys Ala Gly Asn 

100 105 110 

Gly Thr Val Asp Met Ser Val Leu Pro Asp Ala Arg Tyr Ser Ala Leu 

115 120 125 

Leu Gin Glu Asp Phe Ala Tyr Ser Gly Phe Glu Cys Trp Val Glu Asn 

130 135 140 

Glu Asp Gin He Gin Glu Pro His Ser Cys His Gly Ser Glu Gly Pro 
145 150 ' 155 160 

Gly Asn Arg Pro Arg Arg Cys Arg Gly Cys Ala Ala Leu Arg Val Ala 

165 170 175 

Asp Ser Asp Tyr Glu Ala He Cys Lys Val Pro Arg Lys Val Ala Arg 

180 185 190 

Ser He Ser Cys Gly Pro Ser Ser Arg Trp Ser Thr Ser He Cys Thr 
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1^5 200 205 

Glu Glu Pro Ala Leu Ser Glu Val Gly Pro Pro Asp Leu Ala Ser Thr 

210 215 220 

Lys Val Pro Pro Asp Gly Glu Ser Met Glu Glu Glu Thr Pro Glv Ser 
225 230 235 ^ 240 

Ser Val Glu Ser Leu Asp Ala Ser Val Gin Ala Ser Pro Pro Gin Gin 

245 250 255 

Lys Asp Glu Glu Thr Glu Arg Ser Ala Lys Glu Leu Gly Lys Cys Asp 

260 265 270 

Cys Cys Ser Asp Asp Gin Ala Pro Gin His Gly Cys Asn His Lys Leu 

275 280 285 

Glu Leu Ala Leu Ser Met lie Lys Gly Leu Asp Tyr Lys Pro lie Gin 

290 295 300 

Ser Pro Arg Gly Ser Arg Leu Pro He Pro Val Lys Ser Ser Leu Pro 
305 310 315 320 

Gly Ala Lys Pro Gly Pro Ser Met Thr Asp Gly Val Ser Ser Gly Phe 

325 330 335 

Leu Asn Arg Ser Leu Lys Pro Leu Tyr Lys Thr Pro Val Ser Tyr Pro 

340 345 350 

Leu Glu Leu Ser Asp Leu Gin Glu Leu Trp Asp Asp Leu Cys Glu Asv 

355 360 365 

Tyr Leu Pro Leu Arg Val Gin Pro Met Thr Glu Glu Leu Leu Lys Gin 

370 375 380 

Gin Lys Leu Asn Ser His Glu Thr Thr lie Thr Gin Gin Ser Val Ser 
390 395 4OO 

Asp Ser His Leu Ala Glu Leu Gin Glu Lys He Gin Gin Thr Glu Ala 

405 410 415 

Thr Asn Lys He Leu Gin Glu Lys Leu Asn Glu Met Ser Tyr Glu Leu 

420 425 430 

Lys Cys Ala Gin Glu Ser Ser Gin Lys Gin Asp Gly Thr He Gin Asn 

435 440 445 

Leu Lys Glu Thr Leu Lys Ser Arg Glu Arg Glu Thr Glu Glu Leu Tvr 

450 455 460 

Gin Val He Glu Gly Gin Asn Asp Thr Met Ala Lys Leu Arg Glu Met 
470 475 480 

Leu His Gin Ser Gin Leu Gly Gin Leu His Ser Ser Glu Gly Thr Ser 

485 490 495 

Pro Ala Gin Gin Gin Val Ala Leu Leu Asp Leu Gin Ser Ala Leu Phe 

500 505 510 

Cys Ser Gin Leu Glu He Gin Lys Leu Gin Arg Val Val Arg Gin Lys 

515 520 525 

Glu Arg Gin Leu Ala Asp Ala Lys Gin Cys Val Gin Phe Val Glu Ala 

530 535 540 

Ala Ala His Glu Ser Glu Gin Gin Lys Glu Ala Ser Trp Lys His Asn 
545 550 555 560 

Gin Glu Leu Arg Lys Ala Leu Gin Gin Leu Gin Glu Glu Leu Gin Asn 

565 570 575 

Lys Ser Gin Gin Leu Arg Ala Trp Glu Ala Glu Lys Tyr Asn Glu He 

580 585 590 

Arg Thr Gin Glu Gin Asn He Gin His Leu Asn His Ser Leu Ser His 

595 600 605 

Lys Glu Gin Leu Leu Gin Glu Phe Arg Glu Leu Leu Gin Tyr Arg Asp 

610 615 620 ^ • 

Asn Ser Asp Lys Thr Leu Glu Ala Asn Glu Met Leu Leu Glu Lys Leu 
625 630 635 640 

Arg Gin Arg He His Asp Lys Ala Val Ala Leu Glu Arg Ala He Asp 

645 . 650 655 

Glu Lys Phe Ser Ala Leu Glu Glu Lys Glu Lys Glu Leu Arg Gin Leu 

660 665 670 

Arg Leu Ala Val Arg Glu Arg Asp His Asp Leu Glu Arg Leu Arg Asp 

675 680 685 

Val Leu Ser Ser Asn Glu Ala Thr Met Gin Ser Met Glu Ser Leu Leu 
690 695 700 
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Arg Ala Lys Gly Leu Glu Val Glu Gin Leu Ser Thr Thr Cys Gin Asn 
705 710 715 720 

Leu Gin Trp Leu Lys Glu Glu Met Glu Thr Lys Phe Ser Arg Trp Gin 

725 730 735 

Lys Glu Gin Glu Ser He He Gin Gin Leu Gin Thr Ser Leu His Asp 

740 745 750 

Arg Asn Lys Glu Val Glu Asp Leu Ser Ala Thr Leu Leu Cys Lys Leu 

755 760 765 

Gly Pro Gly Gin Ser Glu He Ala Glu Glu Leu Cys Gin Arg Leu Gin 

770 775 780 

Arg Lys Glu Arg Met Leu Gin Asp Leu Leu Ser Asp Arg Asn Lys Gin 
785 790 795 800 

Val Leu Glu His Glu Met Glu He Gin Gly Leu Leu Gin Ser Val Ser 

805 810 815 

Thr Arg Glu Gin Glu Ser Gin Ala Ala Ala Glu Lys Leu Val Gin Ala 

820 825 830 

Leu Met Glu Arg Asn Ser Glu Leu Gin Ala Leu Aij Gin Tyr Leu Gly 

835 840 845 

Gly Arg Asp Ser Leu Met Ser Gin Ala Pro He Ser Asn Gin Gin Ala 

850 855 860 

Glu Val Thr Pro Thr Gly Arg Leu Gly Lys Gin Thr Asp Gin Gly Ser 
865 870 875 880 

Met Gin He Pro Ser Arg Asp Asp Ser Thr Ser Leu Thr Ala Lys Glu 

885 890 895 

Asp Val Ser He Pro Arg Ser Thr Leu Gly Asp Leu Asp Thr Val Ala 

900 905 910 

Gly Leu Glu Lys Glu Leu Ser Asn Ala Lys Glu Glu Leu Glu Leu Met 

915 920 925 

Ala Lys Lys Glu Arg Glu Ser Gin Met Glu Leu Ser Ala Leu Gin Ser 

930 935 940 

Met Met Ala Val Gin Glu Glu Glu Leu Gin Val Gin Ala Ala Asp Met 
945 950 955 960 

Glu Ser Leu Thr Arg Asn He Gin He Lys Glu Asp Leu He Lys Asp 

965 970 975 

Leu Gin Met Gin Leu Val Asp Pro Glu Asp He Pro Ala Met Glu Arg 

980 985 990 

Leu Thr Gin Glu Val Leu Leu Leu Arg Glu Lys Val Ala Ser Val Glu 

995 1000 1005 

Ser Gin Gly Gin Glu He Ser Gly Asn Arg Arg Gin Gin Leu Leu Leu 

1010 1015 1020 

Met Leu Glu Gly Leu Val Asp Glu Arg Ser Arg Leu Asn Glu Ala Leu 
1025 1030 1035 1040 

Gin Ala Glu Arg Gin Leu Tyr Ser Ser Leu Val Lys Phe His Ala His 

1045 1050 1055 

Pro Glu Ser Ser Glu Arg Asp Arg Thr Leu Gin Val Glu Leu Glu Gly 

1060 1065 1070 

Ala Gin Val Leu Arg Ser Arg Leu Glu Glu Val Leu Gly Arg Ser Leu 

1075 1080 1085 

Glu Arg Leu Asn Arg Leu Glu Thr Leu Ala Ala He Gly Gly Ala Ala 

1090 1095 1100 

Ala Gly Asp Asp Thr Glu Asp Thr Ser Thr Glu Phe Thr Asp Ser He 
1105 1110 1115 1120 

Glu Glu Glu Ala Ala His His Ser His Gin Gin Leu Val Lys Val Ala 

1125 1130 1135 

Leu Glu Lys Ser Leu Ala Thr Val Glu Thr Gin Asn Pro Ser Phe Ser 

1140 1145 1150 

Pro Pro Ser Pro Met Gly Gly Asp Ser Asn Arg Cys Leu Gin Glu Glu 

1155 1160 1165 

Met Leu His Leu Arg Ala Glu Phe His Gin His Leu Glu Glu Lys Arg 

1170 1175 1180 

Lys Ala Glu Glu Glu Leu Lys Glu Leu Lys Ala Gin He Glu Glu Ala 
1185 1190 1195 1200 

Gly Phe Ser Ser Val Ser His He Arg Asn Thr Met Leu Ser Leu Cys 
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1205 1210 1215 

Leu Glu Asn Ala Glu Leu Lys Glu Gin Met Gly Glu Ala Met Ser Asp 

1220 1225 1230 

Gly Trp Glu He Glu Glu Asp Lys Glu Lys Gly Glu Val Met Val Glu 

1235 1240 1245 

Thr Val Val Thr Lys Glu Gly Leu Ser Glu Ser Ser Leu Gin Ala Glu 

1250 1255 1260 

Phe Arg Lys Leu Gin Gly Lys Leu Lys Asn Ala His Asn He He Asn 
1265 1270 1275 1280 

Leu Leu Lys Glu Gin Leu Val Leu Ser Ser Lys Glu Gly Asn Ser Lys 

1285 1290 1295 

Leu Thr Pro Glu Leu Leu Val His Leu Thr Ser Thr He Glu Arg He 

1300 1305 1310 

Asn Thr Glu Leu Val Gly Ser Pro Gly Lys His Gin His Gin Glu Glu 

1315 1320 1325 

Gly Asn Val Thr Val Arg Pro Phe Pro Arg Pro Gin Ser Leu Asp Leu 

1^30 1335 1340 

Gly Ala Thr Phe Thr Val Asp Ala His Gin Leu Asp Asn Gin Ser Gin 
1345 1350 1355 1360 

Pro Arg Asp Pro Gly Pro Gin Ser Ala Phe Ser Leu Pro Gly Ser Thr 

1365 1370 1375 

Gin His Leu Arg Ser Gin Leu Ser Gin Cys Lys Gin Arg Tyr Gin Asp 

1380 1385 1390 

Leu Gin Glu Lys Leu Leu Leu Ser Glu Ala Thr Val Phe Ala Gin Ala 

1395 1400 1405 

Asn Glu Leu Glu Lys Tyr Arg Val Met Leu Thr Gly Glu Ser Leu Val 

1410 1415 1420 

Lys Gin Asp Ser Lys Gin He Gin Val Asp Leu Gin Asp Leu Gly Tyr 
1425 1430 1435 1440 

Glu Thr Cys Gly Arg Ser Glu Asn Glu Ala Glu Arg Glu Glu Thr Thr 

1445 1450 1455 

Ser Pro Glu Cys Glu Glu His Asn Ser Leu Lys Glu Met Val Leu Met 

1460 1465 1470 

Glu Gly Leu Cys Ser Glu Gin Gly Arg Arg Gly Ser Thr Leu Ala Ser 

1475 1480 1485 

Ser Ser Glu Arg Lys Pro Leu Glu Asn Gin Leu Gly Lys Gin Glu Glu 

1490 1495 1500 

Phe Arg Val Tyr Gly Lys Ser Glu Asn He Leu Val Leu Arg Lys Asp 
1505 1510 1515 1520 

He Lys Asp Leu Lys Ala Gin Leu Gin Asn Ala Asn Lys Val He Gin 

1525 1530 1535 

Asn Leu Lys Ser Arg Val Arg Ser Leu Ser Val Thr Ser Asp Tyr Ser 

1540 1545 1550 

Ser Ser Leu Glu Arg Pro Arg Lys Leu Arg Ala Val Gly Thr Leu Glu 

1555 1560 1565 

Gly Ser Ser Pro His Ser Val Pro Asp Glu Asp Glu Gly Trp Leu Ser 

1570 1575 1580 

Asp Gly Thr Gly Ala Phe Tyr Ser Pro Gly Leu Gin Ala Lys Lys Asp 
1585 1590 1595 1600 

Leu Glu Ser Leu He Gin Arg Val Ser Gin Leu Glu Ala Gin Leu Pro 

1605 1610 1615 

Lys Asn Gly Leu Glu Glu Lys Leu Ala Glu Glu Leu Arg Ser Ala Ser 

1620 1625 1630 

Trp Pro Gly Lys Tyr Asp Ser Leu He Gin Asp Gin Ala Arg Glu Leu 

1635 1640 1645 

Ser Tyr Leu Arg Gin Lys He Arg Glu Gly Arg Gly He Cys Tyr Leu 

1650 1655 _ 1660 

He Thr Arg His Ala Lys Asp Thr Val Lys Ser Phe Glu Asp Leu Leu 
1665 1670 1675 1680 

Arg Ser Asn Asp He Asp Tyr Tyr Leu Gly Gin Ser Phe Arg Glu Gin 

1685 1690 1695 

Leu Ala Gin Gly Ser Gin Leu Thr Glu Arg Leu Thr Ser Lys Leu Ser 
1700 1705 1710 
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Thr Lys Asp His Lys Ser Glu Lys Asp Gin Ala Gly Leu Glu Pro Leu 

1715 1720 1725 

Ala Leu Arg Leu Ser Arg Glu Leu Gin Glu Lys Glu Lys Val He Glu 

1730 1735 1740 

Val Leu Gin Ala Lys Leu Asp Ala Arg Ser Leu Thr Pro Ser Ser Ser 
1745 1750 1755 1760 

His Ala Leu Ser Asp Ser His Arg Ser Pro Ser Ser Thr Ser Phe Leu 

1765 1770 1775 

Ser Asp Glu Leu Glu Ala Cys Ser Asp Met Asp He Val Ser Glu Tyr 

1780 1785 1790 

Thr His Tyr Glu Glu Lys Lys Ala Ser Pro Ser His Ser Asp Ser He 

1795 1800- 1805 

His His Ser Ser His Ser Ala Val Leu Ser Ser Lys Pro Ser Ser Thr 

1810 1815 1820 

Ser Ala Ser Gin Gly Ala Lys Ala Glu Ser Asn Ser Asn Pro He Ser 
1825 1830 1835 1840 

Leu Pro Thr Pro Gin Asn Thr Pro -^.ys Glu Ala Asn Gin Ala His Ser 

1845 1850 1855 

Gly Phe His Phe His Ser He Pro Lys Leu Ala Ser Leu Pro Gin Ala 

1860 1865 1870 

Pro Leu Pro Ser Ala Pro Ser Ser Phe Leu Pro Phe Ser Pro Thr Gly 

1875 1880 1885 

Pro Leu Leu Leu Gly Cys Cys Glu Thr Pro Val Val Ser Leu Ala Glu 

1890 1895 1900 

Ala Gin Gin Glu Leu Gin Met Leu Gin Lys Gin Leu Gly Glu Ser Ala 

1910 1915 1920 

Ser Thr Val Pro Pro Ala Ser Thr Ala Thr Leu Leu Ser Asn Asp Leu 

1925 1930 1935 

Glu Ala Asp Ser Ser Tyr Tyr Leu Asn Ser Ala Gin Pro His Ser Pro 

1940 1945 1950 

Pro Arg Gly Thr He Glu Leu Gly Arg He Leu Glu Pro Gly Tyr Leu 

1955 1960 1965 

Gly Ser Ser Gly Lys Trp Asp Val Met Arg Pro Gin Lys Gly Ser Val 

1970 1975 1980 

Ser Gly Asp Leu Ser Ser Gly Ser Ser Val Tyr Gin Leu Asn Ser Lys 
1985 1990 1995 2000 

Pro Thr Gly Ala Asp Leu Leu Glu Glu His Leu Gly Glu He Arg Asn 

2005 2010 2015 

Leu Arg Gin Arg Leu Glu Glu Ser He Cys He Asn Asp Arg Leu Arg 

2020 2025 2030 

Glu Gin Leu Glu His Arg Leu Thr Ser Thr Ala Arg Gly Arg Gly Ser 

2035 2040 2045 

Thr Ser Asn Phe Tyr Ser Gin Gly Leu Glu Ser He Pro Gin Leu Cys 

2050 2055 2060 

Asn Glu Asn Arg Val Leu Arg Glu Asp Asn Arg Arg Leu Gin Ala Gin 
2065 2070 2075 2080 

Leu Ser His Val Ser Arg Glu His Ser Gin Glu Thr Glu Ser Leu Arg 

2085 2090 2095 

Glu Ala Leu Leu Ser Ser Arg Ser His Leu Gin Glu Leu Glu Lys Glu 

2100 2105 2110 

Leu Glu His Gin Lys Val Glu Arg Gin Gin Leu Leu Glu Asp Leu Arg 

2115 2120 2125 

Glu Lys Gin Gin Glu Val Leu His Phe Arg Glu Glu Arg Leu Ser Leu 

2130 2135 2140 

Gin Glu Asn Asp Ser Ser Gly Pro Cys Leu Ser Leu Val Arg Leu Gin 
2145 2150 2155 2160 

His Lys Leu Val Leu Leu Gin Gin Gin Cys Glu Glu Lys Gin Gin Leu 

2165 2170 2175 

Phe Glu Ser Leu Gin Ser Glu Leu Gin He Tyr Glu Ala Leu Tyr Gly 

2180 2185 2190 

Asn Ser Lys Lys Gly Leu Lys Ala Tyr Ser Leu Asp Ala Cys His Gin 

2195 2200 2205 

He Pro Leu Ser Ser Asp Leu Ser His Leu Val Ala Glu Val Arg Ala 
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2210 2215 2220 

Leu Arg Gly Gin Leu Glu Gin Ser He Gin Gly Asn Asn Cys Leu Arg 
2225 2230 2235 2240 

Leu Gin Leu Gin Gin Gin Leu Glu Ser Gly Ala Gly Lys Ala Ser Leu 

2245 2250 2255 

Ser Pro Ser Ser He Asn Gin Asn Phe Pro Ala Ser Thr Asp Pro Gly 

2260 2265 2270 

Asn Lys Gin Leu Leu Leu Gin Asp Ser Ala Val Ser Pro Pro Val Arg 

2275 2280 2285 

Asp Val Gly Met Asn Ser Pro Ala Leu Val Phe Pro Ser Ser Ala Ser 

2290 2295 2300 

Ser Thr Pro Gly Ser Glu Thr Pro He He Asn Arg Ala Asn Gly Leu 
2305 2310 2315 2320 

Gly Leu Asp Thr Ser Pro Val Met Lys Thr Pro Pro Lys Leu Glu Gly 

2325 2330 2335 

Asp Ala Thr Asp Gly Ser Phe Ala Asn Lys His Gly Arg His Val He 

2340 2345 2350 

Gly His He Asp Asp Tyr Ser Ala Leu Arg Gin Gin He Ala Glu Gly 

2355 2360 2365 

Lys Leu Leu Val Lys Lys He Val Ser Leu Val Afg Ser Ala Cys Ser 

2370 2375 2380 

Phe Pro Gly Leu Glu Ala Gin Gly Thr Glu Val Leu Gly Ser Lys Gly 
2385 2390 2395 2400 

He His Glu Leu Arg Ser Ser Thr Ser Ala Leu His His Ala Leu Glu 

2405 2410 2415 

Glu Ser Ala Ser Leu Leu Thr Met Phe Trp Arg Ala Ala Leu Pro Ser 

2420 2425 2430 

Thr His He Pro Val Leu Pro Gly Lys Val Gly Glu Ser Thr Glu Arg 

2435 2440 2445 

Glu Leu Leu Glu Leu Arg Thr Lys Val Ser Lys Gin Glu Arg Leu Leu 

2450 2455 2460 

Gin Ser Thr Thr Glu His Leu Lys Asn Ala Asn Gin Gin Lys Glu Ser 
2465 2470 2475 2480 

Met Glu Gin Phe He Val Ser Gin Leu Thr Arg Thr His Asp Val Leu 

2485 2490 2495 

Lys Lys Ala Arg Thr Asn Leu Glu Val Lys Ser Leu Arg Ala Leu Pro 

2500 2505 2510 

Cys Thr Pro Ala Leu 
2515 

<210> 6 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primers 
<400> 6 

cggaattcga ggaggcctac cagaaac 27 

<210> 7 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primers 
<400> 7 

tgagtcgact acgtgtcaag gcaacaatgg tc 32 
<210> 8 
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<211> 1683 
<212> PRT 
<213> rat 

<400> 8 

Met Met Ala Gin Phe Pro Thr Ala Met Asn Gly Gly Pro Asn Met Trp 

^ 5 10 15 

Ala He Thr Ser Glu Glu Arg Thr Lys His Asp Lys Gin Phe Asp Asn 

20 25 30 

Leu Lys Pro Ser Gly Gly Tyr He Thr Gly Asp Gin Ala Arg Thr Phe 

35 40 45 

Phe Leu Gin Ser Gly Leu Pro Ala Pro Val Leu Ala Glu He Trp Ala 

50 55 60 

Leu Ser Asp Leu Asn Lys Asp Gly Lys Met Asp Gin Gin Glu Phe Ser 

70 75 80 

He Ala Met Lys Leu He Lys Leu Lys Leu Gin Gly Gin Gin Leu Pro 

90 95 
Val Val Leu Pro Pro He Met Lys Gin Pro Pro Met Phe Ser Pro Leu 

100 105 110 

He Ser Ala Arg Phe Gly Met Gly Ser Met Pro Asn Leu Ser He His 

115 120 125 

Gin Pro Leu Pro Pro Val Ala Pro He Thr Ala Pro Leu Ser Ser Ala 

130 135 140 

Thr Ser Gly Thr Ser He Pro Pro Leu Met Met Pro Ala Pro Leu Val 
^45 150 155 160 

Pro Ser Val Ser Thr Ser Ser Leu Pro Asn Gly Thr Ala Ser Leu He 

165 170 175 

Gin Pro Leu Ser He Pro Tyr Ser Ser Ser Thr Leu Pro His Ala Ser 

180 185 190 

Ser Tyr Ser Leu Met Met Gly Gly Phe Gly Gly Ala Ser He Gin Lys 

195 200 205 

Ala Gin Ser Leu He Asp Leu Gly Ser Ser Ser Ser Thr Ser Ser Thr 

210 215 220 

Ala Ser Leu Ser Gly Asn Ser Pro Lys Thr Gly Thr Ser Glu Trp Ala 
225 230 235 240 

Val Pro Gin Pro Ser Arg Leu Lys Tyr Arg Gin Lys Phe Asn Ser Leu 

245 250 255 

Asp Lys Ser Met Ser Gly Tyr Leu Ser Gly Phe Gin Ala Arg Asn Ala 

260 265 270 

Leu Leu Gin Ser Asn Leu Ser Gin Thr Gin Leu Ala Thr He Trp Thr 

275 280 285 

Leu Ala Asp He Asp Gly Asp Gly Gin Leu Lys Ala Glu Glu Phe He 

290 295 300 

Leu Ala Met His Leu Thr Asp Met Ala Lys Ala Gly Gin Pro Leu Pro 
305 310 315 320 

Leu Thr Leu Pro Pro Glu Leu Val Pro Pro Ser Phe Arg Gly Glv Lvs 

325 330 335 

Gin He Asp Ser He Asn Gly Thr Leu Pro Ser Tyr Gin Lys Thr Gin 

340 345 3S0 

Glu Glu Glu Pro Gin Lys Lys Leu Pro Val Thr Phe Glu Asp Lys Ara 

355 360 365 

Lys Ala Asn Tyr Glu Arg Gly Asn Met Glu Leu Glu Lys Arg Arg Gin 

370 375 380 

Val Leu Met Glu Gin Gin Gin Arg Glu Ala Glu Arg Lys Ala Gin Lvs 
385 390 395 400 

Glu Lys Glu Glu Trp Glu Arg Lys Gin Arg Glu Leu Gin Glu Gin Glu 

405 . 410 415 

Trp Lys Lys Gin Leu Glu Leu Glu Lys Arg Leu Glu Lys Gin Arg Glu 

420 425 430 

Leu Glu Arg Gin Arg Glu Glu Glu Arg Arg Lys Glu He Glu Arg Arg 

435 440 445 

Glu Ala Ala Lys Gin Glu Leu Glu Arg Gin Arg Arg Leu Glu Trp Glu 
450 455 460 
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Arg He Arg Arg Gin Glu Leu Leu Asn Gin Lys Asn Arg Glu Gin Glu 
465 470 475 480 

Glu He Val Arg Leu Asn Ser Lys Lys Lys Ser Leu His Leu Glu Leu 

485 490 495 

Glu Ala Val Asn Gly Lys His Gin Gin He Ser Gly Arg Leu Gin Asp 

500 505 510 

Val Arg He Arg Lys Gin Thr Gin Lys Thr Glu Leu Glu Val Leu Asp 

515 520 525 

Lys Gin Cys Asp Leu Glu He Met Glu He Lys Gin Leu Gin Gin Glu 

530 535 540 

Leu Gin Glu Tyr Gin Asn Lys Leu He Tyr Leu Val Pro Glu Lys Gin 
^45 550 555 560 

Leu Leu Asn Glu Arg He Lys Asn Met Gin Leu Ser Asn Thr Pro Asp 

565 570 575 

Ser Gly He Ser Leu Leu His Lys Lys Ser Ser Glu Lys Glu Glu Leu 

580 585 590 

Cys Gin Arg Leu Lys Glu Gin Leu Asp Ala Le\i Glu Lys Glu Thr Ala 

595 600 605 

Ser Lys Leu Ser Glu Met Asp Ser Phe Asn Asn Gin Leu Lys Cvs Glv 

610 615 620 

Asn Met Asp Asp Ser Val Leu Gin Cys Leu Leu Ser Leu Leu Ser Cvs 
625 630 635 640 

Leu Asn Asn Leu Phe Leu Leu Leu Lys Glu Leu Arg Glu Ser Tyr Asn 

645 650 655 

Thr Gin Gin Leu Ala Leu Glu Gin Leu His Lys He Lys Arg Asp Lys 

660 665 670 

Leu Lys Glu Leu Glu Arg Lys Arg Leu Glu Gin He Gin Lys Lys Lys 

675 680 685 

Leu Glu Asp Glu Ala Ala Arg Lys Ala Lys Gin Gly Lys Glu Asn Leu 

690 695 700 

Trp Lys Glu Ser He Arg Lys Glu Glu Glu Glu Lys Gin Lys Arg Leu 
705 710 715 720 

Gin Glu Glu Lys Ser Gin Asp Arg Thr Gin Glu Glu Glu Arg Lys Thr 

725 730 735 

Glu Ala Lys Gin Ser Glu Thr Ala Arg Ala Leu Val Asn Tyr Arg Ala 

740 745 750 

Leu Tyr Pro Phe Glu Ala Arg Asn His Asp Glu Met Ser Phe Asn Ser 

755 760 765 

Gly Asp He He Gin Val Asp Glu Lys Thr Val Gly Glu Pro Glv Tro 

770 775 780 

Leu Tyr Gly Ser Phe Gin Gly Lys Phe Gly Trp Phe Pro Cys Asn Tvr 
785 790 795 800 

Val Glu Lys Met Leu Ser Ser Asp Lys Thr Pro Ser Pro Lys Lys Ala 

805 810 815 

Leu Leu Pro Pro Ala Val Ser Leu Ser Ala Thr Ser Ala Ala Pro Gin 

820 825 830 

Pro Leu Cys Ser Asn Gin Pro Ala Pro Val Thr Asp Tyr Gin Asn Val 

835 840 845 

Ser Phe Ser Asn Leu Asn Val Asn Thr Thr Trp Gin Gin LVs Ser Ala 

850 855 860 

Phe Thr Arg Thr Val Ser Pro Gly Ser Val Ser Pro He His Glv Gin 
S65 870 875 880 

Gly Gin Ala Val Glu Asn Leu Lys Ala Gin Ala Leu Cys Ser Trp Thr 

885 890 895 

Ala Lys Lys Glu Asn His Leu Asn Phe Ser Lys His Asp Val He Thr 

900 905 910 

Val Leu Glu Gin Gin Glu Asn Trp Trp Phe Gly Glu Val His Gly Gly 

915 920 925 

Arg Gly Trp Phe Pro Lys Ser Tyr Val Lys He He Pro Gly Ser Glu 

930 935 940 

Val Lys Arg Gly Glu Pro Glu Ala Leu Tyr Ala Ala Val Asn Lys Lys 
945 950 955 960 

Pro Thr Ser Thr Ala Tyr Pro Val Gly Glu Glu Tyr He Ala Leu Tyr 
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965 970 975 

Ser Tyr Ser Ser Val Glu Pro Gly Asp Leu Thr Phe Thr Glu Gly Glu 

980 985 990 

Glu Leu Leu Val Thr Gin Lys Asp Gly Glu Trp Trp Thr Gly Ser He 

995 1000 1005 

Gly Glu Arg Thr Gly lie Phe Pro Ser Asn Tyr Val Arg Pro Lvs Asd 

1010 1015 1020 

Gin Glu Asn Val Gly Asn Ala Ser Lys Ser Gly Ala Ser Asn Lvs Lvs 
1025 1030 1035 1^40 

Pro Glu He Ala Gin Val Thr Ser Ala Tyr Ala Ala Ser Gly Ala Glu 

^ 1045 1050 1055 

Gin Leu Ser Leu Ala Pro Gly Gin Leu He Leu He Leu Lys Lys Asn 

1060 1065 1070 

Ser Ser Gly Trp Trp Gin Gly Glu Leu Gin Ala Arg Gly Lys Lys Ara 
1075 1080 1085 ^ ^ 

Gin Lys Gly Trp Phe Pro Ala Ser His Val Lys Leu Leu Gly Pro Ser 

1090 1095 1100 

Ala Glu Arg Thr Thr Pro Ala Phe His Ala Val Cys Gin Val lie Ala 
1105 1110 1115 1120 

Met Tyr Asp Tyr He Ala Asn Asn Glu Asp Glu Leu Asn Phe Ser Lys 

1125 1130 1135 

Gly Gin Leu He Asn Val Met Asn Lys Asp Asp Pro Asp Trp Trp Gin 

1140 1145 1150 

Gly Glu He Asn Gly Val Thr Gly Leu Phe Pro Ser Asn Tyr Val Lys 

1155 1160 1165 

Met Thr Thr Asp Ser Asp Pro Ser Gin Gin Trp Cys Ala Asp Leu Gin 

1170 1175 1180 

Ala Leu Asp Thr Met Gin Pro Met Glu Arg Lys Arg Gin Gly Tyr He 
1185 1190 1195 1200 

His Glu Leu He Glu Thr Glu Glu Arg Tyr Met Asp Asp Leu Gin Leu 

1205 1210 1215 

Val He Glu Val Phe Gin Lys Arg Met Ala Glu Ser Gly Phe Leu Thr 

1220 1225 1230 

Glu Ala Glu Met Ala Leu He Phe Val Asn Trp Lys Glu Leu He Met 

1235 1240 1245 

Ser Asn Thr Lys Leu Leu Lys Ala Leu Arg Val Arg Lys Lys Thr Gly 

1250 1255 1260 

Gly Glu Lys Met Pro Val Glu Met Met Gly Asp He Leu Ala Ala Glu 
1265 1270 1275 1280 

Leu Ser His Met Gin Ala Tyr He Arg Phe Cys Ser Cys Gin Leu Asn 

1285 1290 1295 

Gly Ala Ala Leu Leu Gin Gin Lys Thr Asp Glu Asp Ala Asp Phe Lys 

1300 1305 1310 

Glu Phe Leu Lys Lys Leu Ala Ser Asp Pro Arg Cys Lys Gly Met Pro 

1315 1320 1325 

Leu Ser Ser Phe Leu Leu Lys Pro Met Gin Arg He Thr Arg Tyr Pro 

1330 1335 1340 

Leu Leu He Arg Ser He Leu Glu Asn Thr Pro Gin Asn His VAl Asp 
1345 1350 1355 1360 

His Ser Ser Leu Lys Leu Ala Leu Glu Arg Ala Glu Glu Leu Cys Ser 

1365 1370 1375 

Gin Val Asn Glu Gly Val Arg Glu Lys Glu Asn Ser Asp Arg Leu Glu 

1380 1385 1390 

Trp He Gin Ala His Val Gin Cys Glu Gly Leu Ala Glu Gin Leu He 

1395 1400 1405 

Phe Asn Ser Leu Thr Asn Cys Leu Gly Pro Arg Lys Leu Leu Tyr Ser 

1410 1415 1420 

Gly Lys Leu Tyr Lys Thr Lys Ser Asn Lys Glu Leu His Gly Phe Leu 
1425 1430 1435 ^440 

Phe Asn Asp Phe Leu Leu Leu Thr Tyr Leu Val Arg Gin Phe Ala Ala 

1445 1450 1455 

Ser Ser Gly Phe Glu Lys Leu Phe Ser Ser Lys Ser Ser Ala Gin Phe 
1460 1465 1470 
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Lys Met Tyr Lys Thr Pro He Phe Leu Asn Glu Val Leu Val Lys Leu 

1475 1480 1485 

Pro Thr Asp Pro Ser Ser Asp Glu Pro Val Phe His He Ser His He 

1490 1495 1500 

Asp Arg Val Tyr Thr Leu Arg Thr Asp Asn He Asn Glu Arg Thr Ala 
1505 1510 1515 1520 

Trp Val Gin Lys He Lys Ala Ala Ser Glu Gin Tyr He Asp Thr Glu 

1525 1530 1535 

Lys Lys Lys Arg Glu Lys Ala Tyr Gin Ala Arg Ser Gin Lys Thr Ser 

1540 1545 1550 

Gly He Gly Arg Leu Met Val His Val He Glu Ala Thr Glu Leu Lvs 

1555 1560 1565 

Ala Cys Lys Pro Asn Gly Lys Ser Asn Pro Tyr Cys Glu He Ser Met 

1570 1575 1580 

Gly Ser Gin Ser Tyr Thr Thr Arg Thr Leu Gin Asp Thr Leu Asn Pro 
^585 1590 1595 leoo 

Lys Trp Asn Phe Asn Cys Gin Phe Phe He Lys Asp Leu Tyr Gin Asp 

1605 1610 1615 

Val Leu Cys Leu Thr Met Phe Asp Arg Asp Gin Phe Ser Pro Asp Asp 

1620 1625 1630 

Phe Leu Gly Arg Thr Glu Val Pro Val Ala Lys He Arg Thr Glu Gin 

1635 1640 1645 

Glu Ser Lys Gly Pro Thr Thr Arg Arg Leu Leu Leu His Glu Val Pro 

1650 1655 1660 

Thr Gly Glu Val Trp Val Arg Phe Asp Leu Gin Leu Phe Glu Gin Lvs 
1665 1670 1675 i680 

Thr Leu Leu 
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